A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency Paper • 2505.01658 • Published 10 days ago • 30
Towards Effective Extraction and Evaluation of Factual Claims Paper • 2502.10855 • Published Feb 15 • 1
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Paper • 2504.19413 • Published 15 days ago • 12
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published 13 days ago • 45
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 29 items • Updated 12 days ago • 88
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Paper • 2504.17789 • Published 19 days ago • 23
DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models Paper • 2504.15716 • Published 21 days ago • 9
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published 18 days ago • 41
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated about 18 hours ago • 140
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 3 days ago • 49
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 21 days ago • 61