9 110 201

Pierre Dulac

dulacp

dulacp

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

deepseek-ai/DeepSeek-V3-0324

upvoted a paper 9 days ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

liked a model 19 days ago

microsoft/table-transformer-structure-recognition

View all activity

Organizations

dulacp's activity

upvoted a paper 9 days ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published 10 days ago • 82

upvoted a paper about 1 month ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18 • 126

upvoted 3 papers 3 months ago

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Paper • 2502.15657 • Published Feb 21 • 5

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 58

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 157

upvoted an article 3 months ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 213

upvoted a collection 4 months ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 157

upvoted 2 articles 4 months ago

Article

Introducing smolagents: simple agents that write actions in code.

and 2 others •

Dec 31, 2024

• 1.04k

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 860

upvoted a paper 4 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 391

upvoted 2 papers 5 months ago

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 84

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 42

upvoted a collection 5 months ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 24 days ago • 71

upvoted 3 papers 5 months ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 40

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 39

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 118

upvoted 4 papers 6 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 134

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Paper • 2412.02687 • Published Dec 3, 2024 • 114

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 64

LLMs Do Not Think Step-by-step In Implicit Reasoning

Paper • 2411.15862 • Published Nov 24, 2024 • 10