Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1, 2024 • 72
DeepSeek R1 (All Versions) Collection DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 2 hours ago • 145
view article Article 🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! By ariG23498 • 7 days ago • 13
view article Article Explore, Curate and Vector Search Any Hugging Face Dataset with Nomic Atlas By MaxNomic • 13 days ago • 29
Transformers.js demos Collection A collection of my favorite WebML demos, built with Transformers.js! • 30 items • Updated Jul 11, 2024 • 104
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • 17 days ago • 13
Jan 17 Releases ❄️ Collection Models and datasets of the second week of Jan 2025. • 23 items • Updated 19 days ago • 10
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 21 days ago • 132
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 20 days ago • 63