File size: 1,448 Bytes
f589b8c d41ab71 f589b8c 5deeab5 d41ab71 f589b8c d41ab71 f589b8c 351b053 f589b8c d41ab71 b93cb21 d41ab71 f589b8c d41ab71 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
language:
- en
license: cc-by-nc-4.0
tags:
- text-generation-inference
- transformers
- unsloth
- mistral
- trl
- sft
- Roleplay
- roleplay
base_model: SanjiWatsuki/Kunoichi-DPO-v2-7B
---
> [!Important]
> Still in experiment
# About this model
Remake [version 2](https://huggingface.co/Alsebay/NarumashiRTS-V2) with safetensor format, more safety and stable method, nothing change too much (base on the model hash). But to be real, in the previous version 2, I used unsafety method to save pretrain model, which could lead apply Lora layer twice to model, that make model have terrible performance. (Thanks Unsloth community told me about this :D )
- **Finetuned with rough translate dataset, to increase the accuracy in TSF theme, which is not quite popular. (lewd dataset)**
- **Finetuned from model :** SanjiWatsuki/Kunoichi-DPO-v2-7B . Thank SanjiWatsuki a lot :)
## GGUF version? [Here](https://huggingface.co/mradermacher/NarumashiRTS-7B-V2-1-GGUF). Thank you, mradermacher!
## V2 have more epochs.
## Dataset
```
Dataset(all are novels):
30% skinsuit
30% possession
35% transform(shapeshift)
5% other
```
# Thank Unsloth for good finetuning tool. This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth) |