dfurman
/

Mistral-7B-Instruct-v0.1

Text Generation

Model card Files Files and versions

dfurman commited on Nov 15, 2023

Commit

434b388

·

1 Parent(s): 3a199cd

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ This model was built via parameter-efficient finetuning of the [mistralai/Mistra
 ## Model Sources
-- **Repository:** [github.com/daniel-furman/sft-demos](https://github.com/daniel-furman/sft-demos/blob/main/src/sft/one_gpu/mistral/sft-mistral-7b-instruct-peft.ipynb)
 ## Evaluation Results
@@ -209,6 +209,8 @@ print(tokenizer.decode(input_ids[0]))
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune LLMs on instruction-following datasets.
 The following `TrainingArguments` config was used:
 - num_train_epochs = 1

 ## Model Sources
+- **Repository:** [github.com/daniel-furman/sft-demos](https://github.com/daniel-furman/sft-demos/blob/main/src/sft/mistral/sft_Mistral_7B_Instruct_v0_1_peft.ipynb)
 ## Evaluation Results
 We use the [SFTTrainer](https://huggingface.co/docs/trl/main/en/sft_trainer) from `trl` to fine-tune LLMs on instruction-following datasets.
+See [here](https://github.com/daniel-furman/sft-demos/blob/main/src/sft/mistral/sft_Mistral_7B_Instruct_v0_1_peft.ipynb) for the finetuning code, which contains an exhaustive view of the hyperparameters employed.
 The following `TrainingArguments` config was used:
 - num_train_epochs = 1