Accelerating Scientific Research with Gemini: Case Studies and Common Techniques Paper • 2602.03837 • Published 6 days ago • 4
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 6 days ago • 51
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published 14 days ago • 40
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces Paper • 2601.11868 • Published 24 days ago • 32
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 19 days ago • 207
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 20 days ago • 37
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities Paper • 2503.04721 • Published Mar 6, 2025 • 2
Nemotron Speech Collection Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 5 days ago • 37
AIBrix: Towards Scalable, Cost-Effective Large Language Model Inference Infrastructure Paper • 2504.03648 • Published Feb 22, 2025 • 1
view article Article Introducing OptiMind, a research model designed for optimization 25 days ago • 34