High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Paper • 2507.05920 • Published 4 days ago • 11
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published 3 days ago • 64
view article Article Transformers backend integration in SGLang By marcsun13 and 4 others • 19 days ago • 44
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • 11 days ago • 87
Tar Collection Unifying Visual Understanding and Generation via Text-Aligned Representations • 5 items • Updated 10 days ago • 14
Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback Paper • 2507.02321 • Published 9 days ago • 38
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 9 days ago • 45
WebSailor: Navigating Super-human Reasoning for Web Agent Paper • 2507.02592 • Published 9 days ago • 90
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation Paper • 2506.21546 • Published 15 days ago • 2
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective Paper • 2507.01925 • Published 9 days ago • 30
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published 11 days ago • 64
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper • 2506.19852 • Published 17 days ago • 38
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper • 2507.01006 • Published 10 days ago • 179
view article Article Bringing Fusion Down to Earth: ML for Stellarator Optimization By cgeorgiaw • 10 days ago • 64