Colmap Dataset#

The NCore Colmap tool converts data from COLMAP data representations into NCore V4 format.

Conventions#

COLMAP’s data format represents an arbitrary number of camera frames with associated poses, but without timestamp information. Because NCore is designed for applications with timestamped data, logical timestamps are assigned to images at a rate of 1 FPS starting from a configurable start time.

Camera Sensors#

COLMAP cameras are identified by integer IDs. The converter maps them to NCore sensor IDs using a configurable prefix (default: camera). Downsampled image directories (images_2, images_4, images_8) are optionally included as additional camera sensor instances, each with their own individually scaled intrinsics.

Example sensor IDs:

camera1 — full-resolution images from COLMAP camera 1
camera1_2 — 2× downsampled images from camera 1
camera1_4 — 4× downsampled images from camera 1
camera2 — full-resolution images from COLMAP camera 2

Camera intrinsics are compatible with the OpenCVPinholeCameraModelParameters model for COLMAP camera types 0–4 (SIMPLE_PINHOLE, PINHOLE, SIMPLE_RADIAL, RADIAL, OPENCV), and OpenCVFisheyeCameraModelParameters for COLMAP camera type 5 (OPENCV_FISHEYE). COLMAP uses the same local camera convention as NCore:

Principal axis along the camera’s +z axis
x-axis points right, y-axis points down

Per-Image Masks#

The converter automatically detects per-image mask files and stores them as per-frame mask properties in the generic_data of each camera frame (grayscale uint8, shape [H, W]).

Three mask-file conventions are supported, checked in priority order:

Explicit masks directory — <sequence_dir>/<masks_dir>/<stem>.png when --masks-dir is configured.
Co-located mask — <image_dir>/<stem>_mask.png alongside the image file.
Separate masks directory — <sequence_dir>/masks/<image_filename>.

If no mask file is found for a given image, the frame is stored without a mask entry. The number of masks found per camera is logged at INFO level.

Point Clouds#

sfm_points — SfM 3D points from the COLMAP sparse reconstruction (optional)

If the COLMAP reconstruction contains 3D points (points3D.bin), they are optionally stored as a PointCloudsComponent named sfm_points in the world coordinate frame. Each point carries its reconstructed RGB color as a typed per-point attribute. The coordinate unit is unitless since COLMAP reconstructions have arbitrary scale.

With the separate-sensors profile (default), the point clouds are stored in their own component group (ncore4-sfm_points.zarr).

Conversion#

The converter uses NCore V4’s component-based architecture. Each COLMAP scene is parsed from a COLMAP reconstruction directory (default: sparse/0/) and written to NCore format via SequenceComponentGroupsWriter with specialized component writers for poses, intrinsics, cameras, and optionally a point clouds component.

Usage#

Run the converter with Bazel from the repository root:

bazel run //tools/data_converter/colmap:convert -- \
    --root-dir <PATH_TO_COLMAP_SCENE> \
    --output-dir <PATH_TO_OUTPUT> \
    colmap-v4

If --root-dir points to a parent directory containing multiple scenes, each subdirectory is treated as a separate sequence.

Base arguments (required):

Argument	Description
`--root-dir PATH`	Path to a single COLMAP scene directory or a parent directory containing multiple scenes
`--output-dir PATH`	Path where converted NCore V4 sequences will be written

Base arguments (optional):

Argument	Description
`--no-cameras`	Disable exporting all camera sensors
`--camera-id ID`	Export only the specified camera (repeatable; defaults to all cameras)
`--no-lidars`	Disable exporting lidar sensors (not applicable to COLMAP — kept for base converter compatibility)
`--verbose`	Enable debug-level logging

Subcommand arguments (colmap-v4):

Argument	Default	Description
`--store-type {itar,directory}`	`itar`	Output store format. `itar` produces an indexed tar archive; `directory` writes plain zarr directories
`--profile {default,separate-sensors,separate-all}`	`separate-sensors`	Component group layout. `default` groups all sensors together; `separate-sensors` gives each sensor its own group; `separate-all` splits every component type into its own group
`--sequence-meta` / `--no-sequence-meta`	enabled	Whether to write a JSON metadata file alongside each converted sequence
`--start-time-sec FLOAT`	`0.0`	Logical start time in seconds assigned to the first image frame
`--camera-prefix TEXT`	`camera`	Prefix prepended to COLMAP integer camera IDs to form NCore sensor IDs (e.g. `camera1`)
`--include-downsampled-images` / `--no-include-downsampled-images`	enabled	Include downsampled image directories (`images_2`, `images_4`, `images_8`) as additional camera sensors
`--include-3d-points` / `--no-include-3d-points`	enabled	Include the SfM point cloud as a `PointCloudsComponent` (`sfm_points`)
`--colmap-dir TEXT`	`sparse/0`	Relative path to the COLMAP reconstruction directory within each sequence
`--images-dir TEXT`	`images`	Relative path to the image directory within each sequence
`--masks-dir TEXT`	(auto-detect)	Explicit masks directory relative to each sequence root. When set, looks for `<masks_dir>/<stem>.png`
`--world-global-mode {none,identity}`	`none`	Controls whether a `("world", "world_global")` static pose is stored. `none` omits it (default); `identity` stores an identity matrix for downstream consumers that require it

For the complete implementation, see tools/data_converter/colmap/converter.py.

API Reference#

V4 Components (ncore.data.v4):

SequenceComponentGroupsWriter - Main writer for V4 sequences
PosesComponent - Static and dynamic pose storage
IntrinsicsComponent - Camera and lidar intrinsics
PointCloudsComponent - Pre-computed point clouds
CameraSensorComponent - Camera frame data
CuboidsComponent - 3D cuboid track observations
MasksComponent - Camera masks

Data Converter (ncore.data_converter):

BaseDataConverter - Abstract base class for converters
BaseDataConverterConfig - Base configuration dataclass

Sensor Models (ncore.data):

OpenCVPinholeCameraModelParameters - Pinhole camera intrinsics model
OpenCVFisheyeCameraModelParameters - Fisheye camera intrinsics model

ScanNet++ Conversion#

The scannetpp-v4 subcommand converts ScanNet++ DSLR scenes using the resized fisheye images (dslr/resized_images/) with the COLMAP OPENCV_FISHEYE camera model. Train/test split metadata from train_test_lists.json is stored in the sequence-level generic_meta_data.

bazel run //tools/data_converter/colmap:convert -- \
    --root-dir /path/to/scannetpp/scene_id \
    --output-dir /path/to/output \
    scannetpp-v4

When --root-dir points to a parent directory containing multiple scenes, each subdirectory with a dslr/colmap/ directory is treated as a separate scene.

Subcommand arguments (scannetpp-v4):

Argument	Default	Description
`--store-type {itar,directory}`	`itar`	Output store format
`--profile {default,separate-sensors,separate-all}`	`separate-sensors`	Component group layout
`--include-3d-points` / `--no-include-3d-points`	enabled	Include COLMAP SfM point cloud as `sfm_points` point clouds component