An extended trained baseline model without using KeystoneFuse data efficient pretraining.
Research supported with Cloud TPUs from Google's TPU Research Cloud (TRC)
Chat template
Files info