Blog, Articles, and discussions

`LeRobotDataset`: Bringing large-scale datasets to lerobot

By September 16, 2025 • 6

Community Articles

view all

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

6 days ago

• 93

How to Choose the Best Open Source LLM for Your Project in 2025

•

7 days ago

• 62

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

5 days ago

• 43

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 215

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

about 1 hour ago

• 4

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

•

Jul 10, 2024

• 80

Making any LLM model "reasoning"

•

Mar 23

• 18

Hands-On with NatureLM-audio: A No-Code Demo

and 8 others •

12 days ago

• 9

Open Source Synthetic Data with MOSTLY AI

•

about 7 hours ago

• 3

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 21

Let's talk about LLM evaluation

•

May 23, 2024

• 188

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 674

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 98

Visible Watermarking with Gradio

By September 15, 2025 • 6

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

By September 11, 2025 • 131

Jupyter Agents: training LLMs to reason with notebooks

By September 10, 2025 • 32

mmBERT: ModernBERT goes Multilingual

By September 9, 2025 • 83

Welcome EmbeddingGemma, Google's new efficient embedding model

By September 4, 2025 • 208

Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation

By September 2, 2025 • 57

Generate Images with Claude and Hugging Face

By August 19, 2025 • 34

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

By August 18, 2025 • 66

MCP for Research: How to Connect AI to Research Tools

By August 18, 2025 • 50

TextQuests: How Good are LLMs at Text-Based Video Games?

By August 12, 2025 guest • 33

Introducing AI Sheets: a tool to work with datasets using open AI models!

By August 8, 2025 • 95

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By August 8, 2025 • 66

🇵🇭 FilBench - Can LLMs Understand and Generate Filipino?

By August 12, 2025 • 15

Vision Language Model Alignment in TRL ⚡️

By August 7, 2025 • 82

Community Articles

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

6 days ago

• 93

How to Choose the Best Open Source LLM for Your Project in 2025

•

7 days ago

• 62

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

5 days ago

• 43

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

5 days ago

• 11

Code a simple RAG from scratch

•

Oct 29, 2024

• 193

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 65

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 26

Fine-tune Any LLM from the Hugging Face Hub with Together AI

and 3 others •

6 days ago

• 6

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 135

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 87

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 215

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

about 1 hour ago

• 4

Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚

•

Jul 10, 2024

• 80

Making any LLM model "reasoning"

•

Mar 23

• 18

Hands-On with NatureLM-audio: A No-Code Demo

and 8 others •

12 days ago

• 9

Open Source Synthetic Data with MOSTLY AI

•

about 7 hours ago

• 3

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 21

Let's talk about LLM evaluation

•

May 23, 2024

• 188

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 674

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 98

View all