ewe666
/

small-rp-models

Model card Files Files and versions Community

ewe666 commited on Mar 10

Commit

7636783

verified ·

1 Parent(s): 683ba14

Update README.md

Browse files

Files changed (1) hide show

README.md +12 -35

README.md CHANGED Viewed

@@ -2,46 +2,23 @@
 pipeline_tag: text-generation
 ---
-Collection of resources and models for storytelling and roleplay. Updated Jan 2025.
-**Current favorite**: [mistralai/Mistral-Small-24B-Instruct-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501)
 Some notes on best usage:
-- some people prefer base models over instruct models, but base models are too unruly
-- in general, roleplay finetunes I find to be braindamaged
-- you also don't want to "overparameterize" by writing too long a prompt
-- Conclusion: use original instruct models with short prompts
-  - actually, on second thought: this idea that the original instruct models have "higher intelligence" might be flawed. The other perspective is that the models are merely a representation of their dataset, which is why you might prefer finetunes and monster merges.
-# ⚒️ Base models
-- Llama 3 (8B) - the OG
-- [Mistral-Nemo-Base-2407](https://huggingface.co/mistralai/Mistral-Nemo-Base-2407) (12B)
-- Qwen2.5
-- Mistral Small
-# 🤖 Instruct models
-- Llama 3 Instruct
-- Qwen 2.5 Instruct
-- [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) (12B)
-- **Mistral Small (22B)** (instruct) - a winner?
-# 😈 Story/roleplay-tuned models
-- [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2)
-- [Sao10K/Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
-- [nothingiisreal/MN-12B-Celeste-V1.9](https://huggingface.co/nothingiisreal/MN-12B-Celeste-V1.9)
-- [anthracite-org/magnum-12b-v2](https://huggingface.co/anthracite-org/magnum-12b-v2)
-- [NeverSleep/Lumimaid-v0.2-8B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-8B)
-- [nbeerbower/mistral-nemo-gutenberg-12B-v2](https://huggingface.co/nbeerbower/mistral-nemo-gutenberg-12B-v2)
-# 🧟‍♂️ Model merges
-- [MarinaraSpaghetti/NemoMix-Unleashed-12B](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B) - **personal favorite**
-- [Sao10K/L3-8B-Lunaris-v1](https://huggingface.co/Sao10K/L3-8B-Lunaris-v1)
-- [aetherwiing/MN-12B-Starcannon-v3](https://huggingface.co/aetherwiing/MN-12B-Starcannon-v3)
-# Misc. links
 - [llama.cpp](https://github.com/ggerganov/llama.cpp) and [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - **preferred LLM software**
 - [/r/localllama](https://www.reddit.com/r/LocalLLaMA/)

 pipeline_tag: text-generation
 ---
+Collection of resources and models for storytelling and roleplay. Updated Mar 2025.
 Some notes on best usage:
+- dont waste time on sampler settings; use recommended and optimize the prompt
+- don't "overparameterize" by writing too long a prompt
+- model size/intelligence is important but they are just mimics, the dataset is very important
+# model ranking
+1. [lars1234/Mistral-Small-24B-Instruct-2501-writer](https://huggingface.co/lars1234/Mistral-Small-24B-Instruct-2501-writer) - **wow** this one is great
+1. [Sao10K/MN-12B-Lyra-v4](https://huggingface.co/Sao10K/MN-12B-Lyra-v4)
+1. [MarinaraSpaghetti/NemoMix-Unleashed-12B](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B)
+1. [Sao10K/Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
+1. [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2)
+1. [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) (12B)
+# misc links
 - [llama.cpp](https://github.com/ggerganov/llama.cpp) and [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - **preferred LLM software**
 - [/r/localllama](https://www.reddit.com/r/LocalLLaMA/)