view article Article Measuring Open-Source Llama Nemotron Models on DeepResearch Bench By nvidia • Aug 4 • 5
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 3 items • Updated 3 days ago • 122
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 337