Loss was 1.668821 and still going down. A second epoch (or more data!) might help.

Uploaded model

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.

GGUF

Model size

7.24B params

Architecture

llama

Hardware compatibility

2-bit

4-bit

8-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for paul-stansifer/qw-mistral-1e-3-7b-gguf

Base model

Quantized

(162)

this model