Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge Paper • 2502.16457 • Published 4 days ago • 10
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models Paper • 2502.14302 • Published 7 days ago • 8
Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence Paper • 2502.14905 • Published 8 days ago • 9
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding Paper • 2502.14949 • Published 6 days ago • 6
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation Paper • 2502.13995 • Published 8 days ago • 8
Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models Paper • 2502.15086 • Published 6 days ago • 14
VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues Paper • 2502.12084 • Published 9 days ago • 29
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper • 2502.14922 • Published 7 days ago • 28
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published 6 days ago • 139
LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models Paper • 2502.14834 • Published 6 days ago • 23
Rethinking Diverse Human Preference Learning through Principal Component Analysis Paper • 2502.13131 • Published 8 days ago • 34
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published 11 days ago • 134
Learning Getting-Up Policies for Real-World Humanoid Robots Paper • 2502.12152 • Published 9 days ago • 36
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper • 2502.13145 • Published 8 days ago • 35
SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models Paper • 2502.12464 • Published 9 days ago • 27
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper • 2502.12900 • Published 8 days ago • 76
ibm-granite/granite-vision-3.1-2b-preview Image-Text-to-Text • Updated about 7 hours ago • 12.3k • 85