Edit model card

Model Card for ViTMix-v1

This model is a poorly functional demo to using MOEs in computer vision

Model Details

Model Description

This Model is mean't to serve more as a blueprint than a base. It has been trained of fashionmnist to prove that I can do tensor maths. It achieves an average loss of 0.4-ish.

The code is in files. Do what you want!

Downloads last month
13
Safetensors
Model size
391M params
Tensor type
F32
·
Inference Examples
Inference API (serverless) does not yet support model repos that contain custom code.

Dataset used to train SE6446/VitMix-v1