Blog, Articles, and discussions

Community Articles

We’re open-sourcing our text-to-image model and the process behind it

Text-to-image Architectural Experiments

Projected Abliteration

AI Model Optimization More Flexible Than Ever

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

Introducing Cogito v2.1

about 5 hours ago

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

Uncensor any LLM with abliteration

KV Caching Explained: Optimizing Transformer Inference Efficiency

Norm-Preserving Biprojected Abliteration

Granite 4.0 Nano: Just how small can you go?

🌳 QAT: The Art of Growing a Bonsai Model

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Visualizing How VLMs Work

Why Did MiniMax M2 End Up as a Full Attention Model?

🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset

about 9 hours ago

Join the AMD Open Robotics Hackathon

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

about 17 hours ago

To Think or Not to Think: A Router for Hybrid LLMs

guideprivacyresearch

Running Privacy-Preserving Inferences on Hugging Face Endpoints

visionvlmmultimodal

Vision Language Models Explained

guidetext2sqldatasets

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

Total noob’s intro to Hugging Face Transformers

nlpcommunityguide

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

guidenlpsynthetic-data

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

guidequantizationtransformers

Quanto: a PyTorch quantization backend for Optimum

ethicsresearchnlp

AI Watermarking 101: Tools and Techniques

+5

February 26, 2024

leaderboardguidecollaboration

Introducing the Red-Teaming Resistance Leaderboard

February 23, 2024

nlpcommunityguide

🪆 Introduction to Matryoshka Embedding Models

February 23, 2024

leaderboardguidecollaboration

Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem

February 20, 2024

🤗 PEFT welcomes new merging methods

February 19, 2024

Synthetic data: save money, time and carbon with open source

February 16, 2024

From OpenAI to Open LLMs with Messages API on Hugging Face

February 8, 2024

Community Articles

We’re open-sourcing our text-to-image model and the process behind it

Text-to-image Architectural Experiments

Projected Abliteration

AI Model Optimization More Flexible Than Ever

The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs

Introducing Cogito v2.1

about 5 hours ago

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases

Uncensor any LLM with abliteration

KV Caching Explained: Optimizing Transformer Inference Efficiency

Norm-Preserving Biprojected Abliteration

Granite 4.0 Nano: Just how small can you go?

🌳 QAT: The Art of Growing a Bonsai Model

The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling

Visualizing How VLMs Work

Why Did MiniMax M2 End Up as a Full Attention Model?

🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset

about 9 hours ago

Join the AMD Open Robotics Hackathon

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

about 17 hours ago

To Think or Not to Think: A Router for Hybrid LLMs

View all articles