Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published 5 days ago • 17
JudgeBench: A Benchmark for Evaluating LLM-based Judges Paper • 2410.12784 • Published 24 days ago • 42
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning Paper • 2410.02884 • Published Oct 3 • 48
Model Depot Collection Leading generative models packaged in OpenVino format optimized for use on AI PCs • 50 items • Updated 13 days ago • 5
Functionary V3.2 Collection Fine-tuning Llama-3.1 using own our prompt template for function calling • 3 items • Updated 25 days ago • 1
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 5 days ago • 160
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published 19 days ago • 42
Pangea Collection A Fully Open Multilingual Multimodal LLM for 39 Languages • 18 items • Updated 8 days ago • 17
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit Paper • 2312.09911 • Published Dec 15, 2023 • 53
Notus 7B v1 Collection Notus 7B v1 models (DPO fine-tune of Zephyr SFT) and datasets used. More information at https://github.com/argilla-io/notus • 11 items • Updated Jul 30 • 18
Llama 3.2 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.2 models, including the configurations • 4 items • Updated Sep 25 • 20
Zephyr ORPO Collection Models and datasets to align LLMs with Odds Ratio Preference Optimisation (ORPO). Recipes here: https://github.com/huggingface/alignment-handbook • 3 items • Updated Apr 12 • 17
Oryx-1.5 Collection Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution • 2 items • Updated 18 days ago • 3