Text Generation
Transformers
PyTorch
Safetensors
English
hf_olmo
custom_code

Any Estimation on When 65B will come?

#10
by DrNicefellow - opened

Looking forward to it.

Ai2 org

Hopefully within the next month but it depends on queue times... lots of other groups using the LUMI supercomputer right now as well, and the job time limit is 48 hours. So at most we can only run for 48 hours at a time before sitting in the queue again.

@epwalsh Thank you very much for the information! Hope everything will go well!

shanearora changed discussion status to closed

@epwalsh BTW, it may be beneficial to the community if you release the intermediate checkpoints before the model is fully trained.

Just pushing this after some time has gone now, is the release of the 65B model still planned?

Sign up or log in to comment