lemonilia
/

LimaRP-Mistral-7B-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lemonilia commited on Sep 27, 2023

Commit

517e367

•

1 Parent(s): 10c3982

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -7,8 +7,8 @@ license: apache-2.0
 This is an experimental version of LimaRP for [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) with
 about 1800 training samples _up to_ 4k tokens length. Contrarily to the previously released "v3" version for Llama-2, this one does
 not include a preliminary finetuning pass on several thousands short stories. Initial testing has shown Mistral to be capable of
-generating on its own the kind of stories that were included there; its training data appears to be quite diverse and not have
-been filtered heavily.
 Due to software limitations, finetuning didn't take advantage yet of the Sliding Window Attention (SWA) which would have allowed
 to use longer conversations in the training data. Thus, this version of LimaRP should be considered an initial attempt and

 This is an experimental version of LimaRP for [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) with
 about 1800 training samples _up to_ 4k tokens length. Contrarily to the previously released "v3" version for Llama-2, this one does
 not include a preliminary finetuning pass on several thousands short stories. Initial testing has shown Mistral to be capable of
+generating on its own the kind of stories that were included there; its training data appears to be quite diverse and does not
+seem to have been filtered for content type.
 Due to software limitations, finetuning didn't take advantage yet of the Sliding Window Attention (SWA) which would have allowed
 to use longer conversations in the training data. Thus, this version of LimaRP should be considered an initial attempt and