Update README.md
Browse files
README.md
CHANGED
@@ -29,9 +29,8 @@ license: apache-2.0
|
|
29 |
Please note that Mistral-22b is still in a WIP. v0.3 has started training now, with a different method than used before, this is to hopefully make the model more round in its internel knowlledge. Through my testing I found V2 to be a significant improvement over v.1.
|
30 |
|
31 |
### Upcoming Release: V.3
|
32 |
-
-
|
33 |
-
- I have done some preliminary results with my
|
34 |
-
|
35 |
|
36 |
### Stay Updated
|
37 |
**V.3**, coming soon! And is currently training, will be done in the next ~24 hours. 🌟Paper Coming Soon🌟
|
|
|
29 |
Please note that Mistral-22b is still in a WIP. v0.3 has started training now, with a different method than used before, this is to hopefully make the model more round in its internel knowlledge. Through my testing I found V2 to be a significant improvement over v.1.
|
30 |
|
31 |
### Upcoming Release: V.3
|
32 |
+
- v0.3 will feature a different base model for testing purposes, however this model is pretty darn good for a second test. :)
|
33 |
+
- I have done some preliminary results with my new v0.3 base model, and it appears to achieve a lower loss after the first epoch compared to the other base model used for v0.1 and v0.2. so we have started training v0.3 with the new base model and with the longer dataset, will be done and released in the next 48 hours. :)
|
|
|
34 |
|
35 |
### Stay Updated
|
36 |
**V.3**, coming soon! And is currently training, will be done in the next ~24 hours. 🌟Paper Coming Soon🌟
|