You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This cross-view birds species dataset consists of paired ground-level bird images and satellite images, along with meta-information associated with the iNaturalist-2021 dataset.
Satellite images along with meta-information - Link
Clone the Remote-Sensing-RVSA repository inside BirdSAT:
cd BirdSAT
git clone https://github.com/ViTAE-Transformer/Remote-Sensing-RVSA.git
Append the code for CVMMAE present in utils_model/CVMMAE.py to the file present in Remote-Sensing-RVSA/MAEPretrain_SceneClassification/models_mae_vitae.py
Download pretrained satellite image encoder from - Link and place inside folder pretrained_models. You might get an error while loading this model. You need to set the option kernel=3 in the file Remote-Sensing-RVSA/MAEPretrain_SceneClassification/models_mae_vitae.py in the class MaskedAutoencoderViTAE.
Download all datasets, unzip them and place inside folder data.
Installing Required Packages
There are two options to setup your environment to be able to run all the functions in the repository:
Using Dockerfile provided in the repository to create a docker image with all required packages:
docker build -t <your-docker-hub-id>/birdsat .
Creating conda Environment with all required packages:
@inproceedings{sastry2024birdsat,
title={BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping},
author={Srikumar, Sastry and Subash, Khanal and Aayush, Dhakal and Huang, Di and Nathan, Jacobs},
booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
year={2024}
}
🔍 Additional Links
Check out our lab website for other interesting works on geospatial understanding and mapping;