lemonilia
/

LimaRP-Mistral-7B-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lemonilia commited on Sep 27, 2023

Commit

7c97ac6

•

1 Parent(s): 4c6fa67

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -7,10 +7,11 @@ license: apache-2.0
 This is an experimental version of LimaRP for [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) with
 about 1800 training samples _up to_ 4k tokens length. Contrarily to the previously released "v3" version for Llama-2, this one does
 not include a preliminary finetuning pass on several thousands story. Initial testing has shown Mistral to be capable of
-generating on its own the kind of stories that were included there
 Due to software limitations, finetuning didn't take advantage yet of the Sliding Window Attention (SWA) which would have allowed
-to use longer conversations in the training data. Thus, this version of LimaRP could be considered an initial attempt.
 For more details about LimaRP, see the model page for the [previously released version for Llama-2](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well. Generally speaking, LimaRP is a longform-oriented, novel-style

 This is an experimental version of LimaRP for [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) with
 about 1800 training samples _up to_ 4k tokens length. Contrarily to the previously released "v3" version for Llama-2, this one does
 not include a preliminary finetuning pass on several thousands story. Initial testing has shown Mistral to be capable of
+generating on its own the kind of stories that were included there.
 Due to software limitations, finetuning didn't take advantage yet of the Sliding Window Attention (SWA) which would have allowed
+to use longer conversations in the training data. Thus, this version of LimaRP could be considered an initial attempt and
+will be updated in the future.
 For more details about LimaRP, see the model page for the [previously released version for Llama-2](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well. Generally speaking, LimaRP is a longform-oriented, novel-style