Triangle104
/

Pantheon-Proto-RP-1.8-30B-A3B-Q3_K_M-GGUF

Model card Files Files and versions Community

Triangle104 commited on May 10

Commit

8285ff4

·

verified ·

1 Parent(s): 141851b

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -17,6 +17,24 @@ tags:
 This model was converted to GGUF format from [`Gryphe/Pantheon-Proto-RP-1.8-30B-A3B`](https://huggingface.co/Gryphe/Pantheon-Proto-RP-1.8-30B-A3B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Gryphe/Pantheon-Proto-RP-1.8-30B-A3B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`Gryphe/Pantheon-Proto-RP-1.8-30B-A3B`](https://huggingface.co/Gryphe/Pantheon-Proto-RP-1.8-30B-A3B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Gryphe/Pantheon-Proto-RP-1.8-30B-A3B) for more details on the model.
+---
+Ever since Qwen 3 released I've been trying to get MoE finetuning to
+work - After countless frustrating days, much code hacking, etc etc I
+finally got a full finetune to complete with reasonable loss values.
+I picked the base model for this since I didn't feel like trying to
+fight a reasoning model's training - Maybe someday I'll make a model
+which uses thinking tags for the character's thoughts or something.
+This time the recipe focused on combining as many data sources as I
+possibly could, featuring synthetic data from Sonnet 3.5 + 3.7, ChatGPT
+4o and Deepseek. These then went through an extensive rewriting pipeline
+ to eliminate common AI cliches, with the hopeful intent of providing
+you a fresh experience.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)