SmolFactory / run_a100_large_experiment.py

Commit History

solves oom error with more reasonable configuration
d60ab6c
verified

Tonic commited on

fix large training script
829d8f4
verified

Tonic commited on

improves requirements and dependencies
0de9de2
verified

Tonic commited on

adds trackio support for large experiment
bb64084
verified

Tonic commited on

adds A100 large experiments
5fe83da
verified

Tonic commited on