Token Bottleneck: One Token to Remember Dynamics Paper β’ 2507.06543 β’ Published 23 days ago β’ 18
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated May 1 β’ 334k β’ 1.46k
Running 2.86k 2.86k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.1-70B-Instruct Text Generation β’ 71B β’ Updated Dec 15, 2024 β’ 505k β’ β’ 833
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation Paper β’ 2410.23090 β’ Published Oct 30, 2024 β’ 56