3 14 37

Arthur EDMOND

Shumatsurontek

AI & ML interests

LLM & Computer Vision

Recent Activity

upvoted a paper 2 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

liked a model 14 days ago

deepseek-ai/DeepSeek-R1-0528

commented on a paper 20 days ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

View all activity

Organizations

upvoted a paper 2 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 7 days ago • 41

liked a model 14 days ago

deepseek-ai/DeepSeek-R1-0528

Text Generation • Updated 27 days ago • 155k • • 2.09k

commented a paper 20 days ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published 20 days ago • 45 •

upvoted a paper 20 days ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published 20 days ago • 45

upvoted a paper 22 days ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 25 days ago • 129

upvoted a paper about 1 month ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 119

upvoted 2 papers 2 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 115

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 128

New activity in huggingface/InferenceSupport 2 months ago

OpenGVLab/InternVL3-78B

👍 13

#801 opened 2 months ago by

galvani4987

liked a model 2 months ago

HiDream-ai/HiDream-I1-Full

Text-to-Image • Updated 8 days ago • 152k • • 901

upvoted a paper 2 months ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8 • 76

liked 2 models 3 months ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • Updated 3 days ago • 63.1k • 414

HuggingFaceTB/SmolVLM2-2.2B-Instruct

Image-Text-to-Text • Updated Apr 8 • 57.5k • 211

updated a model 3 months ago

Shumatsurontek/florence-2-large-ft-mod

Image-Text-to-Text • Updated Apr 9 • 73

published a model 3 months ago

Shumatsurontek/florence-2-large-ft-mod

Image-Text-to-Text • Updated Apr 9 • 73

updated a collection 3 months ago

VLMs

Collection

2 items • Updated Apr 9

liked 2 models 3 months ago

microsoft/Florence-2-large-ft

Image-Text-to-Text • Updated Jul 20, 2024 • 61.2k • 355

microsoft/Florence-2-base

Image-Text-to-Text • Updated Nov 4, 2024 • 601k • 277

liked a Space 3 months ago

Exam 1 - Fundamentals of GRPO

🔥

Test your knowledge of GRPO, TRL, RL, and Deepseek R1.

upvoted a paper 3 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 291

Arthur EDMOND

AI & ML interests

Recent Activity

Organizations

Shumatsurontek's activity

OpenGVLab/InternVL3-78B

Exam 1 - Fundamentals of GRPO