Huggingface.js

community

https://github.com/huggingface/huggingface.js

Activity Feed

AI & ML interests

Collection of JS libraries to interact with the Hugging Face Hub

Recent Activity

mishig updated a Space 3 days ago

huggingfacejs/chat-template-playground

Xenova new activity 14 days ago

huggingfacejs/chat-template-playground:Bump @huggingface/jinja to 0.5.5

julien-c submitted a paper 16 days ago

Shaping capabilities with token-level data filtering

View all activity

mishig

updated a Space 3 days ago

Chat Template Playground

💻

143

Visualize JSON data in an interactive split-view editor

Xenova

in huggingfacejs/chat-template-playground 14 days ago

Bump @huggingface/jinja to 0.5.5

#8 opened 14 days ago by

Xenova

julien-c

submitted a paper to Daily Papers 16 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 18 days ago • 26

Xenova

in huggingfacejs/chat-template-playground 20 days ago

Bump @huggingface/jinja to v0.5.4

#7 opened 20 days ago by

Xenova

pcuenq

posted an update about 1 month ago

Post

3358

👉 What happened in AI in 2025? 👈

We prepared the 2025 version of the HF AI Timeline Grid, highlighting open vs API-based model releases, and allowing you to browse and filter by access, modality, and release type!

Play with it here:
2025-ai-timeline/2025-ai-timeline

Here's my personal quarterly TL;DR:

1️⃣ Q1 — Learning to Reason
Deepseek not only releases a top-notch reasoning model, but shows how to train them and compete with closed frontier models. OpenAI debuts Deep Research.

Significant milestones: DeepSeek R1 & R1-Zero, Qwen 2.5 VL, OpenAI Deep Research, Gemini 2.5 Pro (experimental)

2️⃣ Q2 — Multimodality and Coding
More LLMs embrace multimodality by default, and there's a surge in coding agents. Strong vision, audio, and generative models emerge.

Significant milestones: Llama 4, Qwen 3, Imagen 4, OpenAI Codex, Google Jules, Claude 4

3️⃣ Q3 — "Gold" rush, OpenAI opens up, the community goes bananas
Flagship models get gold in Math olympiads and hard benchmarks. OpenAI releases strong open source models and Google releases the much anticipated nano-banana for image generation and editing. Agentic workflows become commonplace.

Significant milestones: Gemini and OpenAI IMO Gold, gpt-oss, Gemini 2.5 Flash Image, Grok 4, Claude Sonnet 4.5

4️⃣ Q4 — Mistral returns, leaderboard hill-climbing
Mistral is back with updated model families. All labs release impressive models to wrap up the year!

Significant milestones: Claude Opus 4.5, DeepSeek Math V2, FLUX 2, GPT 5.1, Kimi K2 Thinking, Nano Banana Pro, GLM 4.7, Gemini 3, Mistral 3, MiniMax M2.1 🤯

Credits
🙏 NHLOCAL for the source data https://github.com/NHLOCAL/AiTimeline

🫡 @reach-vb for the original idea, design and recipe

🙌 @ariG23498 and yours truly for compiling and verifying the 2025 edition

🥳 Here's to 2026, wishing it becomes the best year ever for open releases and on-device-first use-cases! 🥂

2 replies

merve

in huggingfacejs/tasks 2 months ago

Add image assets for image-text-to-image and image-text-to-video tasks

#12 opened 2 months ago by

multimodalart

Xenova

in huggingfacejs/chat-template-playground 3 months ago

Bump @huggingface/jinja to 0.5.3

#4 opened 3 months ago by

Xenova

coyotte508

updated a model 3 months ago

huggingfacejs/test-model

Updated Nov 7, 2025

coyotte508

updated a collection 3 months ago

Test Collection

Collection

Only used in E2E tests • 5 items • Updated Nov 7, 2025

coyotte508

published a model 3 months ago

huggingfacejs/test-model

Updated Nov 7, 2025

coyotte508

updated a collection 3 months ago

Test Collection

Collection

Only used in E2E tests • 5 items • Updated Nov 7, 2025

pcuenq

in huggingfacejs/tasks 4 months ago

Add video-to-video task demo input output files.

#10 opened 4 months ago by

ShahzebKhoso

merve

posted an update 4 months ago

Post

8939

deepseek-ai/DeepSeek-OCR is out! 🔥 my take ⤵️
> pretty insane it can parse and re-render charts in HTML
> it uses CLIP and SAM features concatenated, so better grounding
> very efficient per vision tokens/performance ratio
> covers 100 languages