Gaetan Lopez

gaetanlop

gaetanlop

AI & ML interests

None yet

Recent Activity

upvoted an article 29 days ago

Gotchas in Tokenizer Behavior Every Developer Should Know

upvoted an article 30 days ago

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

upvoted an article about 1 month ago

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

View all activity

Organizations

None yet

gaetanlop's activity

upvoted an article 29 days ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

•

Apr 18

• 36

upvoted an article 30 days ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

and 5 others •

Aug 21, 2024

• 35

upvoted an article about 1 month ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 264

upvoted 2 articles 2 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 291

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

and 3 others •

Mar 10

• 142

upvoted 3 articles 3 months ago

Article

Process Reinforcement through Implicit Rewards

and 1 other •

Jan 3

• 27

Article

SmolLM - blazingly fast and remarkably powerful

and 2 others •

Jul 16, 2024

• 373

Article

1 Billion Classifications

•

Feb 13

• 43

upvoted 3 articles 4 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 860

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

Jan 20

• 67

upvoted a paper 4 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 98

upvoted an article 7 months ago

Article

A Complete Guide to Audio Datasets

•

Dec 15, 2022

• 34

upvoted a paper 7 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 33

upvoted a paper 8 months ago

The Perfect Blend: Redefining RLHF with Mixture of Judges

Paper • 2409.20370 • Published Sep 30, 2024 • 5

upvoted an article 11 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

and 2 others •

Mar 20, 2024

• 91