small-rp-models / README.md
ewe666's picture
Update README.md
bf4eb61 verified
|
raw
history blame
1.88 kB
---
pipeline_tag: text-generation
---
Good story telling models that can fit in an RTX 3060 12GB. Updated July 2025.
Some notes on best usage:
- dont underestimate the original instruct models, esp from mistral
- dont waste time on sampler settings; use recommended and optimize the prompt
- don't "overparameterize" by writing too long a prompt
- model size/intelligence is important but they are just mimics, the dataset is very important
# model ranking
- **Winner**: [mistralai/Mistral-Small-3.1-24B-Instruct-2503](https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503)
- [Sao10K/MN-12B-Lyra-v4](https://huggingface.co/Sao10K/MN-12B-Lyra-v4)
- [MarinaraSpaghetti/NemoMix-Unleashed-12B](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B)
- [Sao10K/Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
- [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2)
- [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) (12B)
- [PocketDoc/Dans-PersonalityEngine-V1.2.0-24b](https://huggingface.co/PocketDoc/Dans-PersonalityEngine-V1.2.0-24b)
# blacklist
for spammers, degens, and knownothings
- David AU
- Sicarius
# grey list
- The drummer
# misc links
- [llama.cpp](https://github.com/ggerganov/llama.cpp) and [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - **preferred LLM software**
- [/r/localllama](https://www.reddit.com/r/LocalLLaMA/)
- [/lmg/](https://boards.4chan.org/search#/lmg/g)
- [LMSys Chatbot Arena Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard)
- [Uncensored General Intelligence Leaderboard](https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard)
- [/r/SillyTavernAI](https://www.reddit.com/r/SillyTavernAI/)
- NothingiisReal discord
- NeverSleep discord
- SillyTavern discord
- BeaverAI discord