Yash Thube

thubZ9

AI & ML interests

Multimodal learning • CV • RL • Reasoning

Recent Activity

upvoted a paper about 2 hours ago

Unified Reward Model for Multimodal Understanding and Generation

upvoted a paper 2 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

updated a collection 5 days ago

My reading list!

View all activity

Organizations

thubZ9's activity

upvoted a paper about 2 hours ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 3 days ago • 51

upvoted a paper 2 days ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published 4 days ago • 66

upvoted a paper 6 days ago

Visual-RFT: Visual Reinforcement Fine-Tuning

Paper • 2503.01785 • Published 6 days ago • 59

upvoted a paper 12 days ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published 12 days ago • 67

upvoted a paper 13 days ago

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published 14 days ago • 51

upvoted a paper 14 days ago

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published 17 days ago • 92

upvoted 2 papers 17 days ago

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Paper • 2502.14768 • Published 17 days ago • 44

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 17 days ago • 128

upvoted a paper 18 days ago

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 21 days ago • 50

upvoted a paper 19 days ago

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Paper • 2502.13143 • Published 19 days ago • 29

upvoted 2 papers 21 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 24 days ago • 99

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published 23 days ago • 52

upvoted 2 papers 23 days ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 25 days ago • 143

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

Paper • 2502.08468 • Published 26 days ago • 13

upvoted a paper 29 days ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 58

upvoted 3 papers about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 199

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 82

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

upvoted 2 papers about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 341

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 92