• Model : https://huggingface.co/GeneZC/MiniChat-1.5-3B

  • FT : @teilomillet

  • Instruction tune using QLoRA on a french dataset for 1 epoch. The aim was to test and try the dataset. Implementing a customization via a dataset and fine-tuning on it. The way to respond is also important to see if it's taken from the dataset and add to the customization.

This is the first of a long serie of multiple models. Aimed to be minuscule as possible.

  • Batch : 6
  • Gradient step : 1
  • Epoch : 1
  • Lr : 0.0002
Downloads last month
1,050
Safetensors
Model size
3.02B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for teilomillet/MiniMerlin-3b-v0.1

Quantizations
1 model

Collection including teilomillet/MiniMerlin-3b-v0.1