Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. β’ 7 items β’ Updated 13 days ago β’ 55
Cosmos Collection β οΈ This collection is archived. π https://huggingface.co/collections/nvidia/nvidia-cosmos-2 β’ 31 items β’ Updated about 4 hours ago β’ 299
πͺ SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos β’ 12 items β’ Updated May 5, 2025 β’ 241
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 β’ 50 items β’ Updated 25 days ago β’ 136
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi β’ 16 items β’ Updated 13 days ago β’ 242
Octopus v2: On-device language model for super agent Paper β’ 2404.01744 β’ Published Apr 2, 2024 β’ 58
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models Paper β’ 2402.07033 β’ Published Feb 10, 2024 β’ 18
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Paper β’ 2401.09417 β’ Published Jan 17, 2024 β’ 62
ML for Tools Collection Collection of papers about ML for using tools! β’ 25 items β’ Updated Jan 17, 2024 β’ 10
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation Paper β’ 2401.04468 β’ Published Jan 9, 2024 β’ 49
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12, 2024 β’ 249
Understanding LLMs: A Comprehensive Overview from Training to Inference Paper β’ 2401.02038 β’ Published Jan 4, 2024 β’ 65
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers Paper β’ 2312.03694 β’ Published Dec 6, 2023 β’ 2