view article Article PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs By samuellimabraz • 4 days ago • 7
view article Article Distributed SFT with trl and DeepSpeed Part 1: Starting Locally By jlzhou • 6 days ago • 1
view article Article How to Expand Your AI Music Generations of 30 Seconds to Several Minutes By theeseus-ai • Dec 13, 2024 • 4
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • 9 days ago • 13
view article Article Yay! Organizations can now publish blog Articles By huggingface • 8 days ago • 30
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 8 days ago • 52
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 6 days ago • 245
view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo Dec 23, 2024 • 37
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 14 days ago • 50
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3, 2024 • 98
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 14 days ago • 126
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Paper • 2501.01028 • Published 27 days ago • 13
view article Article Recipe: Preparing Multilingual Speech Datasets for TTS Training By PHBJT • Nov 4, 2024 • 16
view article Article Deploying Language Models on Azure Kubernetes: A Complete Beginner's Guide By vpkprasanna • Nov 11, 2024 • 3