Spaces:
Running
Running
Commit History
feat: load from bucket
1c4e839
feat: reduce artifact space + offset step
34cf91c
feat: restore weights on CPU
5f954fc
fix: position embedding for generate method
ebac379
fix: typo
68cc185
fix: load from checkpoint
44b7c3e
feat(modeling): simplify abstract_init
fa72aa7
feat(train) - handle multiple nodes (#130)
0952927
unverified
feat: handle model parallel
1bb3269
fix: style
386f839
style(tokenizer): remove unused variables
605df32
feat: use fast tokenizer
767d78a
feat(train): improve pjit speed
f254058
fix(train): consider correct batch size
b7c7458
feat(train): distributed_shampoo with pjit
cc34d07
style: unsused import
7a176b9
feat(model): clean way to load on cpu
12f323d
feat(train): no batch dimension with pjit
df1fe19
feat(train): progress on pjit
49597a2
feat: use_artifact if run existing
a5ed112
Load from wandb artifact (#121)
f69b21b
unverified
Style (isort).
f9d51f7
Pedro Cuenca
commited on
Tokenizer, config, model can be loaded from wandb.
7e48337
Pedro Cuenca
commited on
feat(data): support accumulation in non-streaming
88c8e06
feat: custom gradient accumulation
2d07559
Change import order again.
2b2be9b
Pedro Cuenca
commited on
Fix import order to make isort happy.
64d99b2
Pedro Cuenca
commited on
Accept changes suggested by linter.
9f522b8
Pedro Cuenca
commited on
Never consider local dirs as remote wandb references.
08dd098
Pedro Cuenca
commited on
Store resolved path after loading model.
55a631d
Pedro Cuenca
commited on
Override from_pretrained to support wandb artifacts.
1023afa
Pedro Cuenca
commited on