LegrandNico
/

Llama-3.2-1B-Instruct-GRPO

text-generation-inference

Model card Files Files and versions Community

Llama-3.2-1B-Instruct-GRPO

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

LegrandNico's picture

Upload model trained with Unsloth

da13519 verified about 1 month ago

.gitattributes

1.57 kB

Upload model trained with Unsloth about 1 month ago
README.md

616 Bytes

Upload README.md with huggingface_hub about 1 month ago
adapter_config.json

876 Bytes

Upload model trained with Unsloth about 1 month ago
adapter_model.safetensors

90.2 MB
LFS

Upload model trained with Unsloth about 1 month ago
chat_template.jinja

3.83 kB

Upload model trained with Unsloth about 1 month ago
special_tokens_map.json

454 Bytes

Upload model trained with Unsloth about 1 month ago
tokenizer.json

17.2 MB
LFS

Upload model trained with Unsloth about 1 month ago
tokenizer_config.json

50.6 kB

Upload model trained with Unsloth about 1 month ago