Going Denser with Open-Vocabulary Part Segmentation

Object detection has been expanded from a limited number of categories to open vocabulary. Moving forward, a complete intelligent vision system requires understanding more fine-grained object descriptions, object parts. In this work, we propose a detector with the ability to predict both open-vocabulary objects and their part segmentation. This ability comes from two designs:

We train the detector on the joint of part-level, object-level and image-level data.
We parse the novel object into its parts by its dense semantic correspondence with the base object.

[arXiv]

Installation

See installation instructions.

Getting Started

See Preparing Datasets and Preparing Models.

See Getting Started for demo, training and inference.

Model Zoo

We provide a large set of baseline results and trained models in the Model Zoo.

License

The majority of this project is licensed under a MIT License. Portions of the project are available under separate license of referred projects, including CLIP, Detic and dino-vit-features. Many thanks for their wonderful works.

Citation

If you use VLPart in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

@article{peize2023vlpart,
  title   =  {Going Denser with Open-Vocabulary Part Segmentation},
  author  =  {Sun, Peize and Chen, Shoufa and Zhu, Chenchen and Xiao, Fanyi and Luo, Ping and Xie, Saining and Yan, Zhicheng},
  journal =  {arXiv preprint arXiv:2305.11173},
  year    =  {2023}
}

🔥 Extension Project

Grounded Segment Anything: From Objects to Parts: A dialogue system to detect, segment and edit anything in part-level in the image.

Semantic-SAM: A universal image segmentation model to enable segment and recognize anything at any desired granularity.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
configs		configs
datasets		datasets
demo		demo
docs		docs
models		models
third_party_ovod		third_party_ovod
tools		tools
vlpart		vlpart
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
GETTING_STARTED.md		GETTING_STARTED.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
requirements.txt		requirements.txt
train_net.py		train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Going Denser with Open-Vocabulary Part Segmentation

Installation

Getting Started

Model Zoo

License

Citation

🔥 Extension Project

About

Uh oh!

Releases

Packages

Languages

License

facebookresearch/VLPart

Folders and files

Latest commit

History

Repository files navigation

Going Denser with Open-Vocabulary Part Segmentation

Installation

Getting Started

Model Zoo

License

Citation

🔥 Extension Project

About

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages