--- license: agpl-3.0 pipeline_tag: zero-shot-object-detection tags: - model_hub_mixin - pytorch_model_hub_mixin --- ### Model Description [YOLOE: Real-Time Seeing Anything](https://arxiv.org/abs/2503.07465) - arXiv: https://arxiv.org/abs/2503.07465 - github: https://github.com/THU-MIG/yoloe ### Installation ``` pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/CLIP pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/ml-mobileclip pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/lvis-api pip install git+https://github.com/THU-MIG/yoloe.git wget https://docs-assets.developer.apple.com/ml-research/datasets/mobileclip/mobileclip_blt.pt ``` ### Prediction Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#prediction. ### Transferring Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#transferring. ### Validation Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#validation. ### Training Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#training. ### BibTeX Entry and Citation Info ``` @misc{wang2025yoloerealtimeseeing, title={YOLOE: Real-Time Seeing Anything}, author={Ao Wang and Lihao Liu and Hui Chen and Zijia Lin and Jungong Han and Guiguang Ding}, year={2025}, eprint={2503.07465}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2503.07465}, } ```