MobileCLIP2 - a apple Collection

apple 's Collections

Core ML Gallery Models

OpenELM Instruct Models

OpenELM Pretrained Models

MobileCLIP Models + DataCompDR Data

DepthPro Models

Core ML Stable Diffusion

Core ML FastViT

Core ML Depth Anything

DFN Models + Data

Core ML Segment Anything 2

MobileCLIP2

updated 1 day ago

MobileCLIP2: Mobile-friendly image-text models with SOTA zero-shot capabilities trained on DFNDR-2B

MobileCLIP2: Improving Multi-Modal Reinforced Training

Paper • 2508.20691 • Published Aug 28, 2025 • 7
apple/MobileCLIP2-S0

Updated Oct 9, 2025 • 150 • 47
apple/MobileCLIP2-S2

Updated Oct 9, 2025 • 73 • 15
apple/MobileCLIP2-B

Updated Oct 9, 2025 • 43 • 3
apple/MobileCLIP2-S3

Updated Oct 9, 2025 • 40 • 5
apple/MobileCLIP2-S4

Updated Oct 9, 2025 • 56 • 14
apple/MobileCLIP2-L-14

Updated Oct 9, 2025 • 36 • 4

Note Timm ViT-L/14 architecture trained on DFNDR-2B (dataset of MobileCLIP2)
apple/MobileCLIP-S3

Updated 10 days ago • 59 • 5

Note New architecture introduced in MobileCLIP2 paper but pretrained on DataCompDR (dataset of MobileCLIP v1)
apple/MobileCLIP-S4

Updated 10 days ago • 77 • 9

Note New architecture introduced in MobileCLIP2 paper but pretrained on DataCompDR (dataset of MobileCLIP v1)
apple/MobileCLIP-L-14

Updated Oct 9, 2025 • 29 • 1

Note Timm ViT-L/14 architecture pretrained on DataCompDR (dataset of MobileCLIP v1)
timm/MobileCLIP2-S0-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11, 2025 • 3.94k • 1

Note 👇Timm checkpoints
timm/MobileCLIP2-S2-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11, 2025 • 2.69k • 4
timm/MobileCLIP2-B-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11, 2025 • 788 • 1
timm/MobileCLIP2-S3-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11, 2025 • 9.62k • 3
timm/MobileCLIP2-S4-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11, 2025 • 667 • 2
timm/MobileCLIP2-L-14-OpenCLIP

Zero-Shot Image Classification • Updated Sep 11, 2025 • 296 • 2
apple/mobileclip2_coca_dfn2b_s13b_mscoco38k_s12m_context77

Updated Oct 9, 2025 • 24 • 1

Note 👇MobileCLIP2 CoCa models for synthetic caption generation used to train MobileCLIP2 models
apple/mobileclip2_coca_dfn2b_s13b_gbc1m-short_context77

Updated Oct 9, 2025 • 19 • 1
apple/mobileclip2_coca_dfn2b_s13b_docci_s12m_context77

Updated Oct 9, 2025 • 23 • 1
apple/mobileclip2_coca_dfn2b_s13b_dci-short_s12m_context77

Updated Oct 9, 2025 • 21 • 1
apple/mobileclip2_coca_dfn2b_s13b_dci-extended_s12m_context77

Updated Oct 9, 2025 • 25 • 1
apple/mobileclip2_coca_dfn2b_s13b_dci-complete_s12m_context77

Updated Oct 9, 2025 • 21 • 1
apple/mobileclip2_coca_dfn2b_s13b_recap-coco-30k_s12m_context77

Updated Oct 9, 2025 • 21 • 1
apple/mobileclip2_coca_dfn2b_s13b_docci_s12m_context256

Updated Oct 9, 2025 • 19 • 1

Note 👇MobileCLIP2 CoCa models (context length=256). Higher chance of generating repeated output.
apple/mobileclip2_coca_dfn2b_s13b_dci-complete_s12m_context256

Updated Oct 9, 2025 • 23 • 1
apple/mobileclip2_coca_dfn2b_s13b_dci-extended_s12m_context256

Updated Oct 9, 2025 • 21 • 1
apple/mobileclip2_coca_dfn2b_s13b_context77

Updated Oct 9, 2025 • 25 • 1

Note MobileCLIP2 CoCa base model. It can be used for fine-tuning new CoCa models on high quality datasets.
apple/DFNDR-12M

Viewer • Updated 3 days ago • 12.8M • 75 • 4

Note 👇DFNDR: MobileCLIP2 Pretraining datasets
apple/DFNDR-12M-bf16

Viewer • Updated 3 days ago • 12.8M • 144 • 2
apple/DFNDR-2B

Updated 3 days ago • 220 • 1