Model : https://huggingface.co/GeneZC/MiniChat-1.5-3B
FT : @teilomillet
Instruction tune using QLoRA on a french dataset for 1 epoch. The aim was to test and try the dataset. Implementing a customization via a dataset and fine-tuning on it. The way to respond is also important to see if it's taken from the dataset and add to the customization.

This is the first of a long serie of multiple models. Aimed to be minuscule as possible.

Batch : 6
Gradient step : 1
Epoch : 1
Lr : 0.0002

Downloads last month: 1,050

Safetensors

Model size

3.02B params

Tensor type

BF16

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for teilomillet/MiniMerlin-3b-v0.1

Quantizations

1 model

Collection including teilomillet/MiniMerlin-3b-v0.1

Merlin

Collection

4 items • Updated Jan 6, 2024