Commits · flax-community/dalle-mini

make checkpointing optional

f6c4cb2

valhalla commited on Oct 19, 2021

handle dtype for embeddings

29db327

valhalla commited on Oct 19, 2021

add gradient checkpointing

95a8ed2

valhalla commited on Oct 19, 2021

fix layernorm

7774483

valhalla commited on Oct 19, 2021

add partition helpers

2856356

valhalla commited on Oct 19, 2021

remove bias and minor fixes

180ed1e

valhalla commited on Oct 19, 2021

add standalone modeling file

6197b2f

valhalla commited on Oct 8, 2021

Merge pull request #27 from tmabraham/fix-forced-bos-token-on-demo

8f484d9
unverified

pcuenca commited on Jul 15, 2021

fix forced bos token, also applying BART model to 8 samples now

8d4e13c

tmabraham commited on Jul 15, 2021

Merge pull request #26 from tmabraham/generation-training-demo

fc8c230
unverified

boris commited on Jul 15, 2021

demo for generation, including during training from wandb artifact

c48da33

tmabraham commited on Jul 15, 2021

Merge pull request #23 from khalidsaifullaah/main

eb591ff
unverified

pcuenca commited on Jul 14, 2021

Merge pull request #24 from borisdayma/feat--log-model-frequently

648e404
unverified

boris commited on Jul 14, 2021

feat: use bart-large-cnn

19d68bb

boris commited on Jul 14, 2021

fix: log metadata

99a1ff5

boris commited on Jul 14, 2021

fix: define function before it is used

d449092

boris commited on Jul 14, 2021

fix: correct arg

283adc6

boris commited on Jul 14, 2021

feat: save model frequently

754f876

boris commited on Jul 14, 2021

feat: split script for small and big runs

5e244d0

boris commited on Jul 14, 2021

feat: update test script

3cccb01

boris commited on Jul 14, 2021

feat: bye bye tensorboard

533b494

boris commited on Jul 14, 2021

feat: use bart large

bb3bfa6

boris commited on Jul 14, 2021

YFCC metadata cleaning and encoding script

2c2f570

khalidsaifullaah commited on Jul 14, 2021

Merge pull request #22 from borisdayma/feat-axis

a1c047b
unverified

boris commited on Jul 14, 2021

fix: use correct key

b20769d

boris commited on Jul 14, 2021

fix: log correct metrics

3fef9c1

boris commited on Jul 14, 2021

feat: hardcode eval_steps

4c5e5a7

boris commited on Jul 14, 2021

fix: eval_steps belongs to training_args

900136f

boris commited on Jul 14, 2021

feat: eval_steps already exists in TrainingArguments

0a0080b

boris commited on Jul 14, 2021

Merge branch 'main'

3ddf1c5

boris commited on Jul 14, 2021

feat: set default x-axis

97a008e

boris commited on Jul 14, 2021

feat: log everything through wandb

19070ab

boris commited on Jul 14, 2021

Merge pull request #21 from borisdayma/feat-no_decay

b29bab7
unverified

boris commited on Jul 14, 2021

feat: eval less often for faster training

f0a53ac

boris commited on Jul 14, 2021

Merge pull request #20 from borisdayma/eval-interval

635402d
unverified

boris commited on Jul 14, 2021

feat: no decay option

5a3211f

boris commited on Jul 14, 2021

feat: use common wandb shared folder

7aa2f4b

boris commited on Jul 14, 2021

feat: change default for quick tests

71c757b

boris commited on Jul 14, 2021

feat: hardcoded datasets

e8709a6

boris commited on Jul 14, 2021

Merge pull request #19 from pcuenca/main

f8b0895
unverified

boris commited on Jul 14, 2021

Merge pull request #18 from borisdayma/change-bart-large-demo

395641f
unverified

boris commited on Jul 14, 2021

Add eval_interval to evaluate and log every so often.

566d5f2

Pedro Cuenca commited on Jul 14, 2021

Notebook to encode splitted YFCC100M files.

82fad8c

Pedro Cuenca commited on Jul 14, 2021

change bart-large-cnn to bart-large in demo folder

5801f13

Ritobrata Ghosh commited on Jul 14, 2021

Shift tokens in numpy because the built in shift function stalls.

835ea55

Pedro Cuenca commited on Jul 14, 2021

fix: should be converted to array

945d86c

boris commited on Jul 14, 2021

fix: labels array

6c1f112

boris commited on Jul 14, 2021

fix: typo

678a62f

boris commited on Jul 14, 2021

Merge pull request #17 from borisdayma/fix-model

357779a
unverified

boris commited on Jul 14, 2021

fix: model config

0be4942

boris commited on Jul 14, 2021

Commit History

make checkpointing optional f6c4cb2

handle dtype for embeddings 29db327

add gradient checkpointing 95a8ed2

fix layernorm 7774483

add partition helpers 2856356

remove bias and minor fixes 180ed1e

add standalone modeling file 6197b2f

Merge pull request #27 from tmabraham/fix-forced-bos-token-on-demo 8f484d9 unverified

fix forced bos token, also applying BART model to 8 samples now 8d4e13c

Merge pull request #26 from tmabraham/generation-training-demo fc8c230 unverified

demo for generation, including during training from wandb artifact c48da33

Merge pull request #23 from khalidsaifullaah/main eb591ff unverified

Merge pull request #24 from borisdayma/feat--log-model-frequently 648e404 unverified

feat: use bart-large-cnn 19d68bb

fix: log metadata 99a1ff5

fix: define function before it is used d449092

fix: correct arg 283adc6

feat: save model frequently 754f876

feat: split script for small and big runs 5e244d0

feat: update test script 3cccb01

feat: bye bye tensorboard 533b494

feat: use bart large bb3bfa6

YFCC metadata cleaning and encoding script 2c2f570

Merge pull request #22 from borisdayma/feat-axis a1c047b unverified

fix: use correct key b20769d

fix: log correct metrics 3fef9c1

feat: hardcode eval_steps 4c5e5a7

fix: eval_steps belongs to training_args 900136f

feat: eval_steps already exists in TrainingArguments 0a0080b

Merge branch 'main' 3ddf1c5

feat: set default x-axis 97a008e

feat: log everything through wandb 19070ab

Merge pull request #21 from borisdayma/feat-no_decay b29bab7 unverified

feat: eval less often for faster training f0a53ac

Merge pull request #20 from borisdayma/eval-interval 635402d unverified

feat: no decay option 5a3211f

feat: use common wandb shared folder 7aa2f4b

feat: change default for quick tests 71c757b

feat: hardcoded datasets e8709a6

Merge pull request #19 from pcuenca/main f8b0895 unverified

Merge pull request #18 from borisdayma/change-bart-large-demo 395641f unverified

Add eval_interval to evaluate and log every so often. 566d5f2

Notebook to encode splitted YFCC100M files. 82fad8c

change bart-large-cnn to bart-large in demo folder 5801f13

Shift tokens in numpy because the built in shift function stalls. 835ea55

fix: should be converted to array 945d86c

fix: labels array 6c1f112

fix: typo 678a62f

Merge pull request #17 from borisdayma/fix-model 357779a unverified

fix: model config 0be4942

make checkpointing optional

f6c4cb2

handle dtype for embeddings

29db327

add gradient checkpointing

95a8ed2

fix layernorm

7774483

add partition helpers

2856356

remove bias and minor fixes

180ed1e

add standalone modeling file

6197b2f

Merge pull request #27 from tmabraham/fix-forced-bos-token-on-demo

8f484d9
unverified

fix forced bos token, also applying BART model to 8 samples now

8d4e13c

Merge pull request #26 from tmabraham/generation-training-demo

fc8c230
unverified

demo for generation, including during training from wandb artifact

c48da33

Merge pull request #23 from khalidsaifullaah/main

eb591ff
unverified

Merge pull request #24 from borisdayma/feat--log-model-frequently

648e404
unverified

feat: use bart-large-cnn

19d68bb

fix: log metadata

99a1ff5

fix: define function before it is used

d449092

fix: correct arg

283adc6

feat: save model frequently

754f876

feat: split script for small and big runs

5e244d0

feat: update test script

3cccb01

feat: bye bye tensorboard

533b494

feat: use bart large

bb3bfa6

YFCC metadata cleaning and encoding script

2c2f570

Merge pull request #22 from borisdayma/feat-axis

a1c047b
unverified

fix: use correct key

b20769d

fix: log correct metrics

3fef9c1

feat: hardcode eval_steps

4c5e5a7

fix: eval_steps belongs to training_args

900136f

feat: eval_steps already exists in TrainingArguments

0a0080b

Merge branch 'main'

3ddf1c5

feat: set default x-axis

97a008e

feat: log everything through wandb

19070ab

Merge pull request #21 from borisdayma/feat-no_decay

b29bab7
unverified

feat: eval less often for faster training

f0a53ac

Merge pull request #20 from borisdayma/eval-interval

635402d
unverified

feat: no decay option

5a3211f

feat: use common wandb shared folder

7aa2f4b

feat: change default for quick tests

71c757b

feat: hardcoded datasets

e8709a6

Merge pull request #19 from pcuenca/main

f8b0895
unverified

Merge pull request #18 from borisdayma/change-bart-large-demo

395641f
unverified

Add eval_interval to evaluate and log every so often.

566d5f2

Notebook to encode splitted YFCC100M files.

82fad8c

change bart-large-cnn to bart-large in demo folder

5801f13

Shift tokens in numpy because the built in shift function stalls.

835ea55

fix: should be converted to array

945d86c

fix: labels array

6c1f112

fix: typo

678a62f

Merge pull request #17 from borisdayma/fix-model

357779a
unverified

fix: model config

0be4942