Is This Tracker On? A Benchmark Protocol for Dynamic Tracking

🌐 Website | 🤗 Data | 📚 Paper

What's New

[2025.09.30] 📣 ITTO is released.
[2025.09.18] 🥳 ITTO accepted at NeurIPS 2025!

Introduction

ITTO is a challenging new benchmark suite for evaluating and diagnosing tracking point tracking methods. Our videos are sourced from existing datasets and egocentric real-world recordings, with high-quality human annotations collected through a multi-stage pipeline. ITTO captures the motion complexity, occlusion patterns, and object diversity characteristic of real-world scenes.

We provide:

Benchmark Dataset: A long-range and real-world tracking benchmark with novel motion complexity to evaluate state-of-the-art tracking methods.
Evaluation: A rigorous evaluation protocol along defined axes of motion complexity.
Annotation Pipeline: A plug-and-play annotation pipeline for collecting high-quality video track annotations.

Dataset Setup

Clone the dataset repository:

   git clone https://huggingface.co/datasets/your-username/your-dataset
   cd your-dataset

Follow the download instructions in the HuggingFace repo to get the full dataset
Then use our dataloaders and evaluation scripts:

Model Evaluation

We provide dataloaders for loading ITTO annotations. You can test your dataset installation by running our example dataloader script:

python dataloaders/check_dataloaders.py

We provide model evaluation scripts that were used to produce the numbers reported in the paper in the evaluation_scripts folder, which also contains per-model installation instructions.

See evaluation_scripts.md for instructions on how to run model evals. We provide separate, instructions for each of the models we evaluate in the ITTO paper:

CoTracker3: evaluation_scripts/cotracker
DELTA: evaluation_scripts/delta
LocoTracker: evaluation_scripts/locotracker
SceneTracker: evaluation_scripts/scenetracker
SpatialTracker: evaluation_scripts/spatialtracker
TapNet/TapNext/BootsTAP/BootsTAPIR: evaluation_scripts/tapnet
TapTr: evaluation_scripts/taptr

Acknowledgements

Our model evals are based on CoTracker3, DELTA, LocoTracker, SceneTracker, SpatialTracker, TapNet, and TapTr. We thank the authors for their excellent work!

License

This code is released for academic and research purposes only under CC BY-NC 4.0. Commercial use is prohibited due to dependencies on non-commercial third-party code. See LICENSE.md for full details.

Citing ITTO

@inproceedings{Demler_2025_Neurips,
    author    = {Demler, Ilona and Chauhan, Saumya and Gkioxari, Georgia},
    title     = {Is This Tracker On? A Benchmark Protocol for Dynamic Tracking},
    booktitle = {39th Conference on Neural Information Processing Systems (NeurIPS 2025) Track on Datasets and Benchmarks},
    month     = {December},
    year      = {2025},
    pages     = {19446-19455}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dataloaders		dataloaders
evaluation_scripts		evaluation_scripts
.gitignore		.gitignore
CITATIONS.md		CITATIONS.md
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Is This Tracker On? A Benchmark Protocol for Dynamic Tracking

What's New

Introduction

Dataset Setup

Model Evaluation

Acknowledgements

License

Citing ITTO

About

Uh oh!

Releases

Packages

Languages

License

ilonadem/itto

Folders and files

Latest commit

History

Repository files navigation

Is This Tracker On? A Benchmark Protocol for Dynamic Tracking

What's New

Introduction

Dataset Setup

Model Evaluation

Acknowledgements

License

Citing ITTO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages