Sugato Ray's picture

Sugato Ray PRO

sugatoray

·

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection about 9 hours ago

liked a model about 9 hours ago

Qwen/Qwen3-Coder-Next-GGUF

updated a collection about 9 hours ago

View all activity

Organizations

upvoted a paper 2 days ago

Accelerating Scientific Research with Gemini: Case Studies and Common Techniques

Paper • 2602.03837 • Published 6 days ago • 4

upvoted an article 4 days ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

+5

6 days ago

•

51

upvoted a collection 6 days ago

Qwen3-Coder-Next

4 items • Updated 6 days ago • 70

upvoted a paper 11 days ago

Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability

Paper • 2601.18778 • Published 14 days ago • 40

upvoted a collection 12 days ago

Kimi K2.5

Moonshot's most powerful model • 2 items • Updated 7 days ago • 44

upvoted a paper 14 days ago

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Paper • 2601.11868 • Published 24 days ago • 32

upvoted a paper 15 days ago

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published 19 days ago • 60

upvoted 2 collections 15 days ago

Qwen3-TTS

7 items • Updated 19 days ago • 283

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 9 items • Updated 19 days ago • 207

upvoted a paper 16 days ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published 18 days ago • 84

upvoted 2 articles 17 days ago

Article

VLM-OCR Recipes on GPU Infrastructure

25 days ago

•

14

Article

Open Responses: What you need to know

+2

26 days ago

•

105

upvoted a collection 17 days ago

GutenOCR

3 items • Updated 18 days ago • 6

upvoted a paper 17 days ago

GutenOCR: A Grounded Vision-Language Front-End for Documents

Paper • 2601.14490 • Published 20 days ago • 37

upvoted a paper 19 days ago

Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities

Paper • 2503.04721 • Published Mar 6, 2025 • 2

upvoted a collection 19 days ago

Nemotron Speech

Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 5 days ago • 37

upvoted a paper 19 days ago

AIBrix: Towards Scalable, Cost-Effective Large Language Model Inference Infrastructure

Paper • 2504.03648 • Published Feb 22, 2025 • 1

upvoted an article 20 days ago

Article

Introducing OptiMind, a research model designed for optimization

25 days ago

•

34

upvoted a collection 22 days ago

FLUX.2

Our second generation of FLUX • 17 items • Updated 22 days ago • 123

upvoted a paper 24 days ago

AI-Researcher: Autonomous Scientific Innovation

Paper • 2505.18705 • Published May 24, 2025 • 1