Rendering-Aware Reinforcement Learning for Vector Graphics Generation Paper β’ 2505.20793 β’ Published May 27 β’ 11
InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation Paper β’ 2407.06423 β’ Published Jul 8, 2024
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction Paper β’ 2503.15661 β’ Published Mar 19 β’ 2
StarFlow: Generating Structured Workflow Outputs From Sketch Images Paper β’ 2503.21889 β’ Published Mar 27 β’ 1
Rendering-Aware Reinforcement Learning for Vector Graphics Generation Paper β’ 2505.20793 β’ Published May 27 β’ 11
Rendering-Aware Reinforcement Learning for Vector Graphics Generation Paper β’ 2505.20793 β’ Published May 27 β’ 11
Distilling semantically aware orders for autoregressive image generation Paper β’ 2504.17069 β’ Published Apr 23 β’ 6
Distilling semantically aware orders for autoregressive image generation Paper β’ 2504.17069 β’ Published Apr 23 β’ 6
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper β’ 2502.01341 β’ Published Feb 3 β’ 39
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks Paper β’ 2412.04626 β’ Published Dec 5, 2024 β’ 14
StarVector: Generating Scalable Vector Graphics Code from Images Paper β’ 2312.11556 β’ Published Dec 17, 2023 β’ 36