ZhimingMa

JimmyMa99

4 20 2

JimmyMa99

AI & ML interests

None yet

Organizations

upvoted a paper 3 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 179

upvoted a paper 5 months ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 355

upvoted a changelog 5 months ago

Hugging Face Changelog

HuggingChat for Papers

Jan 7

• 105

upvoted a paper 8 months ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

Paper • 2511.09915 • Published Nov 13, 2025 • 7

upvoted 2 papers 9 months ago

Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs

Paper • 2510.01954 • Published Oct 2, 2025 • 14

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 62

upvoted a paper 10 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 274

upvoted a paper 11 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 141

upvoted an article 11 months ago

Article

Mahjong: Where Grandmas Beat The Best LLMs

sileixu

•

Feb 18, 2025

• 9

upvoted 2 papers 12 months ago

LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

Paper • 2507.04404 • Published Jul 6, 2025 • 22

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos

Paper • 2507.05675 • Published Jul 8, 2025 • 27

upvoted 9 papers over 1 year ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30, 2025 • 141

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31, 2025 • 29

TeleAntiFraud-28k: A Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection

Paper • 2503.24115 • Published Mar 31, 2025 • 11

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Mar 19, 2025 • 62

Language Models as Continuous Self-Evolving Data Engineers

Paper • 2412.15151 • Published Dec 19, 2024 • 2

SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation

Paper • 2502.08168 • Published Feb 12, 2025 • 12

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10, 2025 • 58

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published Dec 12, 2024 • 39

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published Oct 21, 2024 • 61

ZhimingMa

AI & ML interests

Organizations

JimmyMa99's activity

HuggingChat for Papers

Mahjong: Where Grandmas Beat The Best LLMs