HugoLaurencon (Hugo Laurençon)

upvoted a paper 25 days ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published 26 days ago • 39

upvoted 2 papers about 1 month ago

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30 • 64

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

Paper • 2505.19075 • Published May 25 • 21

upvoted a paper about 2 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 95

upvoted 3 papers 3 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 131

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10 • 29

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 74

upvoted a collection 3 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29 • 568

upvoted a paper 4 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 133

upvoted an article 4 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.27k

upvoted 2 papers 4 months ago

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 39

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10 • 23

upvoted a paper 5 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192

upvoted an article 5 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

By

and 3 others •

Feb 4

• 167

upvoted 4 papers 6 months ago

upvoted 2 papers 7 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 369

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 132

Hugo Laurençon

AI & ML interests

Organizations

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Llama 4

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Open-source DeepResearch – Freeing our search agents

Nougat: Neural Optical Understanding for Academic Documents

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Autonomy-of-Experts Models

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Tensor Product Attention Is All You Need

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Qwen2.5 Technical Report

Building and better understanding vision-language models: insights and future directions

Hugo Laurençon

AI & ML interests

Organizations

HugoLaurencon's activity

Open-source DeepResearch – Freeing our search agents

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control