Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. β’ 65 items β’ Updated 10 days ago β’ 163
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper β’ 2501.18585 β’ Published Jan 30 β’ 61
view article Article FuseO1-Preview: System-II Reasoning Fusion of LLMs By Wanfq and 4 others β’ Jan 20 β’ 21
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models Paper β’ 2501.11873 β’ Published Jan 21 β’ 66
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 79
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers Paper β’ 2408.06195 β’ Published Aug 12, 2024 β’ 74
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models β’ 11 items β’ Updated Dec 6, 2024 β’ 680
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated 1 day ago β’ 163