view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 By tomaarsen • Mar 26 • 157
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 15 days ago • 465
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 15 days ago • 26
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm Paper • 2507.18553 • Published 26 days ago • 39
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 22 days ago • 155
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation • 15B • Updated 20 days ago • 37.6k • 83
view article Article Sensitivity Aware Mixed Precision Quantization V1 By badaoui and 1 other • Jun 13 • 19
view article Article How Long Prompts Block Other Requests - Optimizing LLM Performance By tngtech • Jun 12 • 5
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Paper • 2506.18349 • Published Jun 23 • 13
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Paper • 2506.18349 • Published Jun 23 • 13 • 2