Blog, Articles, and discussions

SOTA OCR on-device with Core ML and dots.ocr

By October 2, 2025 • 27

Community Articles

view all

ModernVBERT: Towards Smaller Visual Document Retrievers

and 4 others •

5 days ago

• 35

Visualizing How VLMs Work

and 1 other •

about 23 hours ago

• 15

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

7 days ago

• 14

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

8 days ago

• 16

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 229

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

14 days ago

• 25

The Past and Present of Sparse Retrieval

•

4 days ago

• 4

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 101

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 47

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 36

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By June 3, 2025 • 262

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

By June 3, 2025 guest • 91

CodeAgents + Structure: A Better Way to Execute Actions

By May 28, 2025 • 76

🐯 Liger GRPO meets TRL

By May 25, 2025 guest • 51

Dell Enterprise Hub is all you need to build AI on premises

By May 23, 2025 • 20

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

By May 23, 2025 • 164

Exploring Quantization Backends in Diffusers

By May 21, 2025 • 43

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By May 21, 2025 • 220

Microsoft and Hugging Face expand collaboration

By May 19, 2025 • 26

The Transformers Library: standardizing model definitions

By May 15, 2025 • 118

Improving Hugging Face Model Access for Kaggle Users

By May 14, 2025 • 33

Blazingly fast whisper transcriptions with Inference Endpoints

By May 13, 2025 • 77

Vision Language Models (Better, Faster, Stronger)

By May 12, 2025 • 538

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

By May 11, 2025 • 79

Community Articles

ModernVBERT: Towards Smaller Visual Document Retrievers

and 4 others •

5 days ago

• 35

Visualizing How VLMs Work

and 1 other •

about 23 hours ago

• 15

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

7 days ago

• 14

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

8 days ago

• 16

Code a simple RAG from scratch

•

Oct 29, 2024

• 214

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 83

There is no such thing as a tokenizer-free lunch

•

13 days ago

• 73

Gaia2 Leaderboard Update: New Models and New Observations

and 3 others •

6 days ago

• 7

Cactus: High-Performance AI Inference on Any Smartphone

•

5 days ago

• 6

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 143

How to Train an Antibody Developability Model

and 1 other •

21 days ago

• 16

Model Quality: Hugging Face Is All You Need

•

12 days ago

• 21

Vocabulary is the most important element of Sparse Retrieval

•

4 days ago

• 5

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 688

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 229

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

14 days ago

• 25

The Past and Present of Sparse Retrieval

•

4 days ago

• 4

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 101

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 47

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 36

View all