neph1
/

bellman-7b-mistral-instruct-v0.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

neph1 commited on Dec 17, 2023

Commit

a609d9c

•

1 Parent(s): 2b37b8f

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -4,6 +4,7 @@ datasets:
 - neph1/bellman-7b-finetune
 language:
 - sv
 ---
 Qlora trained for 5 epochs on 6400 rows of q&a from around 1000 pages from wikipedia + around 100 of python questions and examples from
@@ -11,6 +12,14 @@ eph1/Alpaca-Lora-GPT4-Swedish-Refined (because I had spent so much time cleaning
 gathered examples and some generated using chat-gpt.
 Dataset otherwise generated using gpt-3.5-turbo.
 I may run another 5 epochs on this. But it feels like it's 'aligned' pretty well. (Regular mistral insists Magdalena Andersson is prime minister, still.)
 Example (q8):

 - neph1/bellman-7b-finetune
 language:
 - sv
+library_name: peft
 ---
 Qlora trained for 5 epochs on 6400 rows of q&a from around 1000 pages from wikipedia + around 100 of python questions and examples from
 gathered examples and some generated using chat-gpt.
 Dataset otherwise generated using gpt-3.5-turbo.
+Rank: 16
+Alpha: 16
+Dropout: 0.1
+Context length: 1024
 I may run another 5 epochs on this. But it feels like it's 'aligned' pretty well. (Regular mistral insists Magdalena Andersson is prime minister, still.)
 Example (q8):