EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation Paper • 2410.21271 • Published Oct 28, 2024 • 7
L1 Collection L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 2 items • Updated 27 days ago • 5
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning Paper • 2503.04697 • Published 28 days ago • 3
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond Paper • 2503.21614 • Published 7 days ago • 32
Flux.1-dev ControlNets Collection A collection of ControlNet models for Flux.1-dev by Jasper Research • 4 items • Updated Sep 24, 2024 • 21
ITVTON:Virtual Try-On Diffusion Transformer Model Based on Integrated Image and Text Paper • 2501.16757 • Published Jan 28 • 2
Qwen QwQ-32B Collection Collection Qwen's reasoning models including QwQ (32B) & QVQ (72B) in formats: GGUF, dynamic 4-bit and 16-bit original versions. • 13 items • Updated 3 days ago • 5
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Paper • 2503.16418 • Published 14 days ago • 33
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers Paper • 2503.14487 • Published 16 days ago • 27
Unleashing Vecset Diffusion Model for Fast Shape Generation Paper • 2503.16302 • Published 14 days ago • 42
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 14 days ago • 46
Open-RS Collection Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t" • 8 items • Updated 13 days ago • 11
Process Reward Models Collection Model and Datasets for Qwen 2.5 Math PRM 7B • 6 items • Updated Feb 18 • 2
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline Paper • 2406.11939 • Published Jun 17, 2024 • 7