MiniCPM4 Collection MiniCPM4: Ultra-Efficient LLMs on End Devices β’ 20 items β’ Updated about 11 hours ago β’ 58
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ 20 days ago β’ 44
INTELLECT-2 Collection INTELLECT-2 is a 32 billion parameter language model with globally distributed reinforcement learning. β’ 3 items β’ Updated May 11 β’ 22
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other β’ Apr 30 β’ 171
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. β’ 65 items β’ Updated 17 days ago β’ 153
view article Article Cohere on Hugging Face Inference Providers π₯ By burtenshaw and 6 others β’ Apr 16 β’ 126
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking β’ 6 items β’ Updated Apr 12 β’ 65
Sky-T1-7B Collection A series of 7B models trained with different recipes and the corresponding training data. β’ 8 items β’ Updated Feb 14 β’ 7
Light-R1 Collection Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond β’ 7 items β’ Updated Mar 13 β’ 12
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 483
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases Paper β’ 2412.04862 β’ Published Dec 6, 2024 β’ 51