zzhhjjj (Haojun Zhao)

upvoted 3 articles 11 months ago

Article

Vision Language Models Explained

merve, edbeeching

•

Apr 11, 2024

• 535

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

+5

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 258

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 778

upvoted an article about 1 year ago

Article

Open R1: Update #3

open-r1

•

Mar 11, 2025

• 297

upvoted an article over 1 year ago

Article

Fixing Open LLM Leaderboard with Math-Verify

+2

hynky, alozowski, SaylorTwift, clefourrier

•

Feb 14, 2025

• 31

upvoted 2 papers over 1 year ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 261

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 380

upvoted an article over 1 year ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

+4

medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf

•

Sep 18, 2024

• 280

upvoted an article almost 2 years ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

+1

neuralink, lvwerra, thomwolf

•

Aug 14, 2024

• 76

Haojun Zhao

AI & ML interests

Organizations

Vision Language Models Explained

nanoVLM: The simplest repository to train your VLM in pure PyTorch

SmolLM3: smol, multilingual, long-context reasoner

Open R1: Update #3

Fixing Open LLM Leaderboard with Math-Verify

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Qwen2.5 Technical Report

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

A failed experiment: Infini-Attention, and why we should keep trying?

Haojun Zhao

AI & ML interests

Organizations

zzhhjjj's activity

Vision Language Models Explained

nanoVLM: The simplest repository to train your VLM in pure PyTorch

SmolLM3: smol, multilingual, long-context reasoner

Open R1: Update #3

Fixing Open LLM Leaderboard with Math-Verify

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

A failed experiment: Infini-Attention, and why we should keep trying?