lemonilia
/

LimaRP-Mistral-7B-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lemonilia commited on Sep 27, 2023

Commit

554bde2

•

1 Parent(s): 74e9cd7

Update README.md

Files changed (1) hide show

README.md +10 -11

README.md CHANGED Viewed

@@ -20,17 +20,6 @@ roleplaying chat model intended to replicate the experience of 1-on-1 roleplay o
 IRC/Discord-style RP (aka "Markdown format") is not supported yet. The model does not include instruction tuning,
 only manually picked and slightly edited RP conversations with persona and scenario data.
-## Important notes on generation settings
-It's recommended not to go overboard with low tail-free-sampling (TFS) values. From previous testing with Llama-2,
-decreasing it too much appeared to easily yield rather repetitive responses. Extensive testing with Mistral has not
-been performed yet, but suggested starting generation settings are:
-- TFS = 0.92~0.95
-- Temperature = 0.70~0.85
-- Repetition penalty = 1.05~1.10
-- top-k = 0 (disabled)
-- top-p = 1 (disabled)
 ## Prompt format
 Same as before. It uses the [extended Alpaca format](https://github.com/tatsu-lab/stanford_alpaca),
 with `### Input:` immediately preceding user inputs and `### Response:` immediately preceding
@@ -99,6 +88,16 @@ your desired response length:
 ![settings](https://files.catbox.moe/6lcz0u.png)
 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
 on a 2x NVidia A40 GPU cluster.

 IRC/Discord-style RP (aka "Markdown format") is not supported yet. The model does not include instruction tuning,
 only manually picked and slightly edited RP conversations with persona and scenario data.
 ## Prompt format
 Same as before. It uses the [extended Alpaca format](https://github.com/tatsu-lab/stanford_alpaca),
 with `### Input:` immediately preceding user inputs and `### Response:` immediately preceding
 ![settings](https://files.catbox.moe/6lcz0u.png)
+## Text generation settings
+Extensive testing with Mistral has not been performed yet, but suggested starting text
+generation settings may be:
+- TFS = 0.92~0.95
+- Temperature = 0.70~0.85
+- Repetition penalty = 1.05~1.10
+- top-k = 0 (disabled)
+- top-p = 1 (disabled)
 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
 on a 2x NVidia A40 GPU cluster.