Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

PocketDoc commited on Jun 9, 2023

Commit

16f9e28

unverified ·

1 Parent(s): b9083a7

Update README.md to reflect current gradient checkpointing support

Browse files

Previously the readme stated gradient checkpointing was incompatible with 4-bit lora in the current implementation however this is no longer the case. I have replaced the warning with a link to the hugging face documentation on gradient checkpointing.

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -387,7 +387,7 @@ train_on_inputs: false
 # don't use this, leads to wonky training (according to someone on the internet)
 group_by_length: false
-# does not work with current implementation of 4-bit LoRA
 gradient_checkpointing: false
 # stop training after this many evaluation losses have increased in a row

 # don't use this, leads to wonky training (according to someone on the internet)
 group_by_length: false
+# Whether to use gradient checkpointing https://huggingface.co/docs/transformers/v4.18.0/en/performance#gradient-checkpointing
 gradient_checkpointing: false
 # stop training after this many evaluation losses have increased in a row