Triangle104
/

magnum-v4-12b-Q4_K_S-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on Oct 31, 2024

Commit

fc20bff

·

verified ·

1 Parent(s): 14c2c94

Update README.md

Files changed (1) hide show

README.md +41 -0

README.md CHANGED Viewed

@@ -110,6 +110,47 @@ model-index:
 This model was converted to GGUF format from [`anthracite-org/magnum-v4-12b`](https://huggingface.co/anthracite-org/magnum-v4-12b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/anthracite-org/magnum-v4-12b) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`anthracite-org/magnum-v4-12b`](https://huggingface.co/anthracite-org/magnum-v4-12b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/anthracite-org/magnum-v4-12b) for more details on the model.
+---
+Model details:
+-
+This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet and Opus.
+This model is fine-tuned on top of mistralai/Mistral-Nemo-Instruct-2407.
+Prompting
+-
+A typical input would look like this:
+<s>[INST] SYSTEM MESSAGE
+USER MESSAGE[/INST] ASSISTANT MESSAGE</s>[INST] USER MESSAGE[/INST]
+SillyTavern templates
+-
+Below are Instruct and Context templates for use within SillyTavern.
+context template
+instruct template
+Credits
+-
+We'd like to thank Recursal / Featherless for sponsoring the compute for this train, Featherless has been hosting our Magnum models since the first 72 B and has given thousands of people access to our models and helped us grow.
+We would also like to thank all members of Anthracite who made this finetune possible.
+Datasets
+    anthracite-org/c2_logs_32k_llama3_qwen2_v1.2_no_system
+    anthracite-org/kalo-opus-instruct-22k-no-refusal-no-system
+    anthracite-org/kalo-opus-instruct-3k-filtered-no-system
+    anthracite-org/nopm_claude_writing_fixed
+    anthracite-org/kalo_opus_misc_240827_no_system
+    anthracite-org/kalo_misc_part2_no_system
+Training
+-
+The training was done for 2 epochs. We used 8xH100s GPUs graciously provided by Recursal AI / Featherless AI for the full-parameter fine-tuning of the model.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)