Gradient Checkpointing for HF Trainer

#23
by acon96 - opened

Wire up existing checkpoint logic to work with transformers Trainer

Looking forward to the merge of this PR!

It would be great if you could merge this

Microsoft org

Hello everyone!

We have an ongoing PR in https://github.com/huggingface/transformers/pull/28163 which will solve this issue.

Regards,
Gustavo.

gugarosa changed pull request status to closed

Sign up or log in to comment