|
--- |
|
pipeline_tag: text-generation |
|
--- |
|
|
|
Good story telling models that can fit in an RTX 3060 12GB. Updated July 2025. |
|
|
|
Some notes on best usage: |
|
- dont underestimate the original instruct models, esp from mistral |
|
- dont waste time on sampler settings; use recommended and optimize the prompt |
|
- don't "overparameterize" by writing too long a prompt |
|
- model size/intelligence is important but they are just mimics, the dataset is very important |
|
|
|
# model ranking |
|
|
|
- **Winner**: [mistralai/Mistral-Small-3.1-24B-Instruct-2503](https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503) |
|
- [Sao10K/MN-12B-Lyra-v4](https://huggingface.co/Sao10K/MN-12B-Lyra-v4) |
|
- [MarinaraSpaghetti/NemoMix-Unleashed-12B](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B) |
|
- [Sao10K/Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2) |
|
- [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2) |
|
- [Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) (12B) |
|
- [PocketDoc/Dans-PersonalityEngine-V1.2.0-24b](https://huggingface.co/PocketDoc/Dans-PersonalityEngine-V1.2.0-24b) |
|
|
|
# blacklist |
|
for spammers, degens, and knownothings |
|
- David AU |
|
- Sicarius |
|
|
|
# grey list |
|
- The drummer |
|
|
|
# misc links |
|
|
|
- [llama.cpp](https://github.com/ggerganov/llama.cpp) and [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - **preferred LLM software** |
|
- [/r/localllama](https://www.reddit.com/r/LocalLLaMA/) |
|
- [/lmg/](https://boards.4chan.org/search#/lmg/g) |
|
- [LMSys Chatbot Arena Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard) |
|
- [Uncensored General Intelligence Leaderboard](https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard) |
|
- [/r/SillyTavernAI](https://www.reddit.com/r/SillyTavernAI/) |
|
- NothingiisReal discord |
|
- NeverSleep discord |
|
- SillyTavern discord |
|
- BeaverAI discord |