Carview!

CARVIEW

MOTORHOMES

Select Language

CrossLoc Benchmark Datasets

large-scale benchmark datasets for sim-to-real visual localization,
including synthetic and real images with 3D and semantic labels on urban and natural sites.

50k+

Synthetic Images

7k+

Real Images

270

Hectares

6+

Modalities

We introduce two large-scale sim-to-real benchmark datasets to exemplify the utility of the TOPO-DataGen. Please visit the CrossLoc Benchmark Datasets page hosted by Drayd for download. See the GitHub repo for other dataset details and how to setup.
All 7k+ real images are accurately geo-referenced using a professional drone, numerous ground control points, and a GNSS base.
All real images have matching synthetic RGB images with a validated positioning accuracy at cm-level.
Most generated 3D labels have a reprojection error of no more than 2 pixels.
All synthetic data is created using the fully open-sourced geo-data from the Swiss government swisstopo. The semantic maps are also generated.
Carefully calibrated camera intrinsic parameters and the 6D camera poses are provided.

(Note: quality may be compromised due to GIF compression)

CrossLoc localization

A cross-modal visual representation learning method via self-supervision for absolute localization.

The CrossLoc learns to localize the query image by predicting its scene coordinates using a set of cross-modal encoders, followed by camera pose estimation using a PnP solver. Similar to self-supervised learning, it leverages data structure to create additional supervisory signals to enhance learning. Specifically, it makes use of the coordinate-depth-normal geometric hierarchy for self-supervision: from the 3D scene coordinate labels, one could compute accurate 6D camera pose and subsequently compute the depth and surface normals without any external labels. It's observed that the tasks of coordinate-depth-normal regression are geometrically highly related. The aggregated cross-modal representations could be used to enhance the final task of coordinate regression.

See the CrossLoc localization code for training code, pretrained models, and baseline implementations.

Paper

CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data
CVPR 2022

GitHub repositories:

Baseline implementation:

CVPR Camera Ready

BibTex

@article{yan2021crossloc,
        title={CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data},
        author={Yan, Qi and Zheng, Jianhao and Reding, Simon and Li, Shanci and Doytchinov, Iordan},
        journal={arXiv preprint arXiv:2112.09081},
        year={2021}
}

@misc{iordan2022crossloc, 
	title={CrossLoc Benchmark Datasets}, 
	author={Doytchinov, Iordan and Yan, Qi and Zheng, Jianhao and Reding, Simon and Li, Shanci}, 
	publisher={Dryad}, 
	doi={10.5061/DRYAD.MGQNK991C}, 
	url={https://datadryad.org/stash/dataset/doi:10.5061/dryad.mgqnk991c},
	year={2022}
}

Team

Qi Yan

EPFL

Jianhao Zheng

EPFL

Simon Reding

EPFL

Shanci Li

EPFL

Iordan Doytchinov

EPFL

Partner

Original Source | Taken Source

CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data

CVPR 2022

How to obtain Data and Model for
Absolute Visual Re-localization?

TOPO-DataGen

Datasets

CrossLoc

Abstract

TOPO-DataGen

CrossLoc Benchmark Datasets