Model Card

Model Details

  • Architecture: ViT-Large with patch size 14
  • Training Data: MNIST dataset

Training Details

Adam Optimizer with a constant learning rate 1e-5 for 4000 steps training (batch_size=32). Only the vision encoder is fine-tuned.

Evaluation Results

  • pre-trained: 0.7602328658103943
  • fine-tuned: 0.9975429177284241
Downloads last month
480
Safetensors
Model size
303M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for tanganke/clip-vit-large-patch14_mnist

Finetuned
(51)
this model

Dataset used to train tanganke/clip-vit-large-patch14_mnist

Collection including tanganke/clip-vit-large-patch14_mnist