metadata
license: agpl-3.0
pipeline_tag: zero-shot-object-detection
tags:
- model_hub_mixin
- pytorch_model_hub_mixin
Model Description
YOLOE: Real-Time Seeing Anything
- arXiv: https://arxiv.org/abs/2503.07465
- github: https://github.com/THU-MIG/yoloe
Installation
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/CLIP
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/ml-mobileclip
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/lvis-api
pip install git+https://github.com/THU-MIG/yoloe.git
wget https://docs-assets.developer.apple.com/ml-research/datasets/mobileclip/mobileclip_blt.pt
Prediction
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#prediction.
Transferring
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#transferring.
Validation
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#validation.
Training
Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#training.
BibTeX Entry and Citation Info
@misc{wang2025yoloerealtimeseeing,
title={YOLOE: Real-Time Seeing Anything},
author={Ao Wang and Lihao Liu and Hui Chen and Zijia Lin and Jungong Han and Guiguang Ding},
year={2025},
eprint={2503.07465},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2503.07465},
}