Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 2 days ago • 8
Thinking Beyond Tokens: From Brain-Inspired Intelligence to Cognitive Foundations for Artificial General Intelligence and its Societal Impact Paper • 2507.00951 • Published 3 days ago • 15
Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images Paper • 2506.22960 • Published 6 days ago • 4
Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations Paper • 2506.13901 • Published 18 days ago • 3
Can Large Language Models Infer Causal Relationships from Real-World Text? Paper • 2505.18931 • Published May 25 • 1
Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods Paper • 2505.17870 • Published May 23 • 5
From Fog to Failure: How Dehazing Can Harm Clear Image Object Detection Paper • 2502.02027 • Published Feb 4 • 1
YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment Paper • 2502.03512 • Published Feb 5 • 5
Multilingual State Space Models for Structured Question Answering in Indic Languages Paper • 2502.01673 • Published Feb 1 • 2
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding Paper • 2501.15747 • Published Jan 27 • 7
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data Paper • 2501.08167 • Published Jan 14 • 6
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval Paper • 2412.15443 • Published Dec 19, 2024 • 10
Improving speaker verification robustness with synthetic emotional utterances Paper • 2412.00319 • Published Nov 30, 2024 • 2
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting Paper • 2412.00869 • Published Dec 1, 2024 • 4
Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI) Paper • 2411.16754 • Published Nov 24, 2024 • 4
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 12
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models Paper • 2411.10867 • Published Nov 16, 2024 • 10
Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types Paper • 2409.09269 • Published Sep 14, 2024 • 9
Density Adaptive Attention-based Speech Network: Enhancing Feature Understanding for Mental Health Disorders Paper • 2409.00391 • Published Aug 31, 2024 • 4