Image Classification
MLX
Safetensors
data2vec-vision
vision

Data2Vec-Vision (large-sized model, fine-tuned on ImageNet-1k)

model image

BEiT model pre-trained in a self-supervised fashion and fine-tuned on ImageNet-1k (1,2 million images, 1000 classes) at resolution 224x224. It was introduced in the paper data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language by Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli and first released in this repository.

Usage

from mlx_ssl.models import Data2VecVisionForImageClassification

model = Data2VecVisionForImageClassification.from_pretrained(
    "mlx-community/data2vec-vision-large-ft1k"
)
Downloads last month
44
Safetensors
Model size
305M params
Tensor type
I64
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train mlx-community/data2vec-vision-large-ft1k