Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, AI4Science

Recent Activity

liked a dataset 21 days ago

LEAP/ClimSim_high-res

upvoted an article 27 days ago

Finally, a Replacement for BERT: Introducing ModernBERT

liked a dataset 2 months ago

mcherukara/PtychoNN_data

View all activity

Organizations

None yet

liked a dataset 21 days ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 21k • 12

upvoted an article 27 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 671

liked a dataset 2 months ago

mcherukara/PtychoNN_data

Updated Mar 18 • 19 • 1

liked 2 models 3 months ago

allenai/ACE2-ERA5

Updated 16 days ago • 51 • 5

microsoft/aurora

Updated Jun 20 • 39

upvoted an article 4 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 45

liked 3 Spaces 5 months ago

Memory Viz

🧠

Memory Viz

Predict Memory

🧮

Analyze and visualize memory usage from model configurations

2.87k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 6 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

liked 2 datasets 6 months ago

PleIAs/common_corpus

Viewer • Updated Jun 10 • 470M • 42.9k • 304

HuggingFaceFW/fineweb-edu

Viewer • Updated 21 days ago • 3.5B • 108k • 723

liked 2 models 6 months ago

mistralai/Mistral-Small-24B-Base-2501

24B • Updated 4 days ago • 13.2k • 255

meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 304k • 612

liked a model 7 months ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 528k • • 3.93k

upvoted a collection 7 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 146

liked a model 7 months ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15 • 1.26M • • 907

liked 2 Spaces 7 months ago

TheWell

🌍

Visualization of data from the Well

1.02k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model 7 months ago

deepseek-ai/DeepSeek-V3-Base

685B • Updated Mar 27 • 22.6k • 1.66k

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1

TheWell

FineWeb: decanting the web for the finest text data at scale