Text Generation
scaling
umup-research-3b-fp8 / model_state_layer_26_TransformerLMHead.pt

Commit History