Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Blog, Articles, and discussions
New Article
community
guide
open source collab
partnerships
research
NLP
Audio
CV
RL
ethics
Diffusion
Game Development
RLHF
Leaderboard
Case Studies
LeRobot
Inference Providers
Community Articles
view all
We’re open-sourcing our text-to-image model and the process behind it
7 days ago
•
63
Text-to-image Architectural Experiments
6 days ago
•
29
Projected Abliteration
25 days ago
•
25
AI Model Optimization More Flexible Than Ever
2 days ago
•
12
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
4 days ago
•
11
Introducing Cogito v2.1
about 5 hours ago
•
11
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
17 days ago
•
42
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
15 days ago
•
48
Uncensor any LLM with abliteration
Jun 13, 2024
•
721
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
174
Norm-Preserving Biprojected Abliteration
13 days ago
•
13
Granite 4.0 Nano: Just how small can you go?
22 days ago
•
118
🌳 QAT: The Art of Growing a Bonsai Model
10 days ago
•
15
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
1 day ago
•
7
Visualizing How VLMs Work
Oct 7
•
45
Why Did MiniMax M2 End Up as a Full Attention Model?
21 days ago
•
63
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
about 9 hours ago
•
6
Join the AMD Open Robotics Hackathon
6 days ago
•
6
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
about 17 hours ago
•
6
To Think or Not to Think: A Router for Hybrid LLMs
3 days ago
•
6
guide
privacy
research
Running Privacy-Preserving Inferences on Hugging Face Endpoints
18
April 16, 2024
vision
vlm
multimodal
Vision Language Models Explained
492
April 11, 2024
guide
text2sql
datasets
Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B
29
April 4, 2024
guide
community
Total noob’s intro to Hugging Face Transformers
96
March 22, 2024
nlp
community
guide
Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval
104
March 22, 2024
guide
nlp
synthetic-data
Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models
104
March 20, 2024
guide
quantization
transformers
Quanto: a PyTorch quantization backend for Optimum
45
March 18, 2024
ethics
research
nlp
AI Watermarking 101: Tools and Techniques
+5
27
February 26, 2024
leaderboard
guide
collaboration
Introducing the Red-Teaming Resistance Leaderboard
13
February 23, 2024
nlp
community
guide
🪆 Introduction to Matryoshka Embedding Models
178
February 23, 2024
leaderboard
guide
collaboration
Introducing the Open Ko-LLM Leaderboard: Leading the Korean LLM Evaluation Ecosystem
5
February 20, 2024
guide
llm
nlp
🤗 PEFT welcomes new merging methods
24
February 19, 2024
guide
llm
nlp
Synthetic data: save money, time and carbon with open source
82
February 16, 2024
guide
llm
nlp
From OpenAI to Open LLMs with Messages API on Hugging Face
20
February 8, 2024
Previous
1
2
3
4
5
6
...
16
Next
Community Articles
Sort: Trending
We’re open-sourcing our text-to-image model and the process behind it
7 days ago
•
63
Text-to-image Architectural Experiments
6 days ago
•
29
Projected Abliteration
25 days ago
•
25
AI Model Optimization More Flexible Than Ever
2 days ago
•
12
The Heterogeneous Feature of RoPE-based Attention in Long-Context LLMs
4 days ago
•
11
Introducing Cogito v2.1
about 5 hours ago
•
11
The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix
17 days ago
•
42
ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases
15 days ago
•
48
Uncensor any LLM with abliteration
Jun 13, 2024
•
721
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30
•
174
Norm-Preserving Biprojected Abliteration
13 days ago
•
13
Granite 4.0 Nano: Just how small can you go?
22 days ago
•
118
🌳 QAT: The Art of Growing a Bonsai Model
10 days ago
•
15
The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling
1 day ago
•
7
Visualizing How VLMs Work
Oct 7
•
45
Why Did MiniMax M2 End Up as a Full Attention Model?
21 days ago
•
63
🧠 SQaLe: Enabling new Text-to-SQL models with our massive dataset
about 9 hours ago
•
6
Join the AMD Open Robotics Hackathon
6 days ago
•
6
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
about 17 hours ago
•
6
To Think or Not to Think: A Router for Hybrid LLMs
3 days ago
•
6
View all articles