Adrien Bufort's picture

Adrien Bufort PRO

Forbu14

·

AdrienB

AI & ML interests

Deep learning, machine learning, reinforcement learning. @orange

Recent Activity

upvoted a collection about 4 hours ago

liked a dataset about 7 hours ago

openclimatefix/era5

liked a dataset 1 day ago

Lucasdegeorge/ImageNet_TA_IA

View all activity

Organizations

upvoted a collection about 4 hours ago

The Well

A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24 • 8

upvoted an article 21 days ago

Article

Welcome to Inference Providers on the Hub 🔥

By

and 6 others •

Jan 28

• 486

upvoted an article about 1 month ago

Article

Transformers Are Getting Old: Variants and Alternatives Exist!

By

•

Jul 5

• 42

upvoted a paper 3 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 37

upvoted 3 papers 4 months ago

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers

Paper • 2504.10483 • Published Apr 14 • 21

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 129

Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

Paper • 2403.03206 • Published Mar 5, 2024 • 69

upvoted an article 4 months ago

Article

GSMA Open-Telco LLM Benchmarks

By

and 6 others •

Feb 25

• 27

upvoted an article 5 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted a paper 5 months ago

How far can we go with ImageNet for Text-to-Image generation?

Paper • 2502.21318 • Published Feb 28 • 26

upvoted 2 papers 6 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 193

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 122

upvoted 2 papers 7 months ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published Jan 2 • 18

upvoted a collection 7 months ago

OLMo 2

Artifacts for the OLMo 2 release. • 35 items • Updated May 1 • 136

upvoted 2 collections 10 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 18 days ago • 155

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 18 days ago • 51

upvoted 2 papers over 1 year ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 62

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 152

upvoted a collection over 1 year ago

Models

41 items • Updated Nov 29, 2024 • 1