11 109 27

Dhruv Diddi

ddiddi

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

rajpurkarlab/ReXGradient-160K

upvoted an article about 1 month ago

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

liked a model about 1 month ago

nvidia/Cosmos-Reason1-7B

View all activity

Organizations

upvoted an article about 1 month ago

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

and 6 others •

May 11

• 67

upvoted 4 collections about 1 month ago

upvoted a paper about 2 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5 • 78

upvoted 2 papers 2 months ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 38

Real-World Gaps in AI Governance Research

Paper • 2505.00174 • Published Apr 30 • 11

upvoted a paper 3 months ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 106

upvoted an article 4 months ago

Article

Transformers.js v3: WebGPU support, new models & tasks, and more…

•

Oct 22, 2024

• 74

upvoted a collection 4 months ago

Gemma 3 Release

Collection

24 items • Updated May 30 • 398

upvoted 7 papers 4 months ago

LocAgent: Graph-Guided LLM Agents for Code Localization

Paper • 2503.09089 • Published Mar 12 • 13

Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol

Paper • 2503.05860 • Published Mar 7 • 11

AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models

Paper • 2503.08417 • Published Mar 11 • 8

"Principal Components" Enable A New Language of Images

Paper • 2503.08685 • Published Mar 11 • 12

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published Mar 10 • 99

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 46

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 87

upvoted 2 papers 5 months ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 22

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 126

Dhruv Diddi

AI & ML interests

Recent Activity

Organizations

ddiddi's activity

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

Transformers.js v3: WebGPU support, new models & tasks, and more…