Running 2.4k 2.4k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper โข 2502.05171 โข Published Feb 7 โข 132
Byte Latent Transformer: Patches Scale Better Than Tokens Paper โข 2412.09871 โข Published Dec 13, 2024 โข 95
Running on CPU Upgrade 12.9k 12.9k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots