Blog, Articles, and discussions

mmBERT: ModernBERT goes Multilingual

By September 9, 2025 • 92

Community Articles

view all

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

9 days ago

• 55

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

4 days ago

• 10

"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack

•

4 days ago

• 9

Code a simple RAG from scratch

•

Oct 29, 2024

• 198

How to Train an Antibody Developability Model

and 1 other •

3 days ago

• 7

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

3 days ago

• 7

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

1 day ago

• 7

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 68

Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel

and 2 others •

3 days ago

• 5

Finegrain Product Placement LoRA (experiment)

•

2 days ago

• 5

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 360

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 89

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 218

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 28

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 91

Diffusion Language Models: The New Paradigm

•

Jun 10

• 16

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

and 1 other •

20 days ago

• 12

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By July 16, 2025 • 67

HuggingFace, IISc partner to supercharge model building on India's diverse languages

By February 27, 2025 • 23

Visual Document Retrieval Goes Multilingual

By January 10, 2025 guest • 75

Finally, a Replacement for BERT: Introducing ModernBERT

By December 19, 2024 guest • 687

Announcing New Hugging Face and KerasHub integration

By July 10, 2024 • 3

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

By April 3, 2024 guest • 11

Interactively explore your Huggingface dataset with one line of code

By October 25, 2023 • 1

Accelerating over 130,000 Hugging Face models with ONNX Runtime

By October 4, 2023 • 1

Deploying Hugging Face Models with BentoML: DeepFloyd IF in Action

By August 9, 2023 guest • 1

Happy 1st anniversary 🤗 Diffusers!

By July 20, 2023 • 2

Panel on Hugging Face

By June 22, 2023

Welcome fastText to the 🤗 Hub

By June 6, 2023 • 5

Introducing BERTopic Integration with Hugging Face Hub

By May 31, 2023 • 10

Creating Privacy Preserving AI with Substra

By April 12, 2023 • 2

Community Articles

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

and 1 other •

9 days ago

• 55

How to Choose the Best Open Source LLM for Your Project in 2025

•

11 days ago

• 68

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

10 days ago

• 95

mem-agent: Persistent, Human Readable Memory Agent Trained with Online RL

and 1 other •

9 days ago

• 16

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

4 days ago

• 10

"Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack

•

4 days ago

• 9

Code a simple RAG from scratch

•

Oct 29, 2024

• 198

How to Train an Antibody Developability Model

and 1 other •

3 days ago

• 7

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

3 days ago

• 7

Unleashing the Full Potential of ERNIE4.5 using FastDeploy

and 3 others •

1 day ago

• 7

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 68

Use AI on Your PC: Optimize and Deploy a Multimodal Agentic Pipeline on AI PC Powered by Intel

and 2 others •

3 days ago

• 5

Finegrain Product Placement LoRA (experiment)

•

2 days ago

• 5

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 360

Decoding Strategies in Large Language Models

•

Oct 29, 2024

• 89

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 218

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 28

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 5 others •

Jun 11

• 91

Diffusion Language Models: The New Paradigm

•

Jun 10

• 16

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

and 1 other •

20 days ago

• 12

View all