Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models Paper • 2506.06751 • Published 19 days ago • 72
Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation Paper • 2505.18875 • Published May 24 • 40
Autellix: An Efficient Serving Engine for LLM Agents as General Programs Paper • 2502.13965 • Published Feb 19 • 19
DiskGNN: Bridging I/O Efficiency and Model Accuracy for Out-of-Core GNN Training Paper • 2405.05231 • Published May 8, 2024 • 1
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Paper • 2502.09411 • Published Feb 13 • 21
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks Paper • 2502.08235 • Published Feb 12 • 58
NanoFlow: Towards Optimal Large Language Model Serving Throughput Paper • 2408.12757 • Published Aug 22, 2024 • 18