Text Generation
Transformers
PyTorch
Safetensors
English
hf_olmo
custom_code

Gradient Checkpointing

#5
by amadalincostea2 - opened

Does the model support gradient checkpointing?

Ai2 org

The OLMo codebase supports activation checkpointing.

But since you're here in Huggingface, and not on GitHub, you probably want to know whether the Huggingface version of OLMo supports it?

Same person, different account. Yes I meant for the Huggingface version.

Ai2 org

@akshitab , do we have to do anything special to make activation checkpointing work?

dirkgr changed discussion status to closed

Sign up or log in to comment