view article Article Build an AI Shopping Assistant with Gradio MCP Servers By freddyaboulton • 3 days ago • 23
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 5 days ago • 131
view article Article Improving Parquet Dedupe on Hugging Face Hub By yuchenglow and 1 other • Oct 5, 2024 • 38
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? By orrzohar and 3 others • 11 days ago • 31
view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events By vinid and 6 others • 17 days ago • 28
view article Article <p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p> By hba123 and 2 others • 21 days ago • 11
view article Article Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data By Pclanglais • Apr 18, 2024 • 23
Speech Evals Collection Synthesized speech evals generated by MistralAI from popular text evaluation datasets to evaluate spoken-language reasoning capabilities of Audio LLMs • 3 items • Updated 16 days ago • 5
StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling Paper • 2506.12570 • Published Jun 14 • 1
view article Article Introducing ColQwen-Omni: Retrieve in every modality By manu and 4 others • 17 days ago • 59
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 15 items • Updated 5 days ago • 82
view article Article Building the Hugging Face MCP Server By evalstate and 3 others • 24 days ago • 54
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published 27 days ago • 19
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 26 days ago • 604
view article Article SmolVLM - small yet mighty Vision Language Model By andito and 4 others • Nov 26, 2024 • 344
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 283
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language Paper • 2506.20920 • Published Jun 26 • 64