Rei-KTO-24B

Model banner

Model Information

Rei-KTO-24B
KTO enhanced Painted Fantasy Finetune Creative Prose

A model meant to replicate the style and prose of the Anthropic Claude models, Opus and Sonnet. This model is meant for Roleplaying/Creative-writing, Has some nice smarts without being too sloppy, etc - It's pretty good. Trained in 2 steps, Firstly SFT trained on Zerofata's PaintedFantasy which i found great at anime-otaku-esque characters, and then KTO'd to improve coherency and Instruct Following

Quantized Versions

Available Downloads

  • GGUF FormatFor use with LLama.cpp & Forks (Ty Mradermacher <3)
  • EXL2 FormatFor use with TabbyAPI (Coming Soon!)

Prompting

The model is tuned with V7 Tekken formatting. A typical input would look like this:

[SYSTEM_PROMPT]system_prompt[/SYSTEM_PROMPT][INST]Hi there![/INST]Nice to meet you![INST]Can I ask a question?[/INST]

Training

Training was done in 2 steps, SFT>KTO

Access Configs
 SFT: https://wandb.ai/new-eden/Painted-Fantasy-Rei/artifacts/axolotl-config/config-u7to9d5q/v0/files/axolotl_config_f0p7vnaf.yml 
              KTO : https://wandb.ai/new-eden/Painted-Rei/artifacts/axolotl-config/config-8n37w77c/v0/files/axolotl_config_hvrd2tzn.yml
            

Training

The training was done for 2 epochs using 8 x A100s for 24 hours/p>

Credits

Thank you to Lucy Knada, Ateron, Alicat, Intervitens, Cgato, Kubernetes Bad and the rest of Anthracite.

Downloads last month
153
Safetensors
Model size
23.6B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Delta-Vector/Rei-24B-KTO