Model Card

Model Details

  • Architecture: ViT-Large with patch size 14
  • Training Data: SUN397 dataset

Training Details

Adam Optimizer with a constant learning rate 1e-5 for 4000 steps training (batch_size=32). Only the vision encoder is fine-tuned.

Evaluation Results

  • pre-trained: 0.6830110549926758
  • fine-tuned: 0.8275973796844482
Downloads last month
485
Safetensors
Model size
303M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for tanganke/clip-vit-large-patch14_sun397

Finetuned
(51)
this model

Dataset used to train tanganke/clip-vit-large-patch14_sun397

Collection including tanganke/clip-vit-large-patch14_sun397