Vezora
/

Mistral-22B-v0.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Vezora commited on Apr 13, 2024

Commit

e88c46c

·

verified ·

1 Parent(s): 7665fa8

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -26,11 +26,11 @@ license: apache-2.0
 - **16k Sequence Length** This model was trained with a 16k sequence length.
 ### Experimental Nature
-Please note that Mistral-22b is still in a WIP. V.3 has started training now, with a different method than used before, this is to hopefully make the model more round in its internel knowlledge. Through my testing i found V2 to be a significant improvement over v.1.
 ### Upcoming Release: V.3
-- V-3 will feature a different base model for testing purposed, however this model is pretty darn good for a second test. :)
-- I have done some preliminary results with my different base model, and it appears to achieve a lower loss after the first epoch compared to the other base model used for v0.1 and v0.2. so we have started training v0.3 with the new base model and with the longer dataset.
 ### Stay Updated

 - **16k Sequence Length** This model was trained with a 16k sequence length.
 ### Experimental Nature
+Please note that Mistral-22b is still in a WIP. v0.3 has started training now, with a different method than used before, this is to hopefully make the model more round in its internel knowlledge. Through my testing I found V2 to be a significant improvement over v.1.
 ### Upcoming Release: V.3
+- v-0.3 will feature a different base model for testing purposed, however this model is pretty darn good for a second test. :)
+- I have done some preliminary results with my different base model, and it appears to achieve a lower loss after the first epoch compared to the other base model used for v0.1 and v0.2. so we have started training v0.3 with the new base model and with the longer dataset, will be done and released in the next 48 hours.
 ### Stay Updated