Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published 2 days ago • 8 • 1
Peccavi: Visual Paraphrase Attack Safe and Distortion Free Image Watermarking Technique for AI-Generated Images Paper • 2506.22960 • Published 6 days ago • 4 • 1
Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations Paper • 2506.13901 • Published 18 days ago • 3 • 2
Can Large Language Models Infer Causal Relationships from Real-World Text? Paper • 2505.18931 • Published May 25 • 1 • 2
Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods Paper • 2505.17870 • Published May 23 • 5 • 2
YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment Paper • 2502.03512 • Published Feb 5 • 5 • 2
IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding Paper • 2501.15747 • Published Jan 27 • 7 • 2
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data Paper • 2501.08167 • Published Jan 14 • 6 • 2
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization Paper • 2501.03271 • Published Jan 5 • 10 • 2
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval Paper • 2412.15443 • Published Dec 19, 2024 • 10 • 2
Improving speaker verification robustness with synthetic emotional utterances Paper • 2412.00319 • Published Nov 30, 2024 • 2 • 2
Exploring the Abilities of Large Language Models to Solve Proportional Analogies via Knowledge-Enhanced Prompting Paper • 2412.00869 • Published Dec 1, 2024 • 4 • 2
Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI) Paper • 2411.16754 • Published Nov 24, 2024 • 4 • 2
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 12 • 2
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models Paper • 2411.10867 • Published Nov 16, 2024 • 10 • 4
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models Paper • 2411.10867 • Published Nov 16, 2024 • 10 • 4
DM-Codec: Distilling Multimodal Representations for Speech Tokenization Paper • 2410.15017 • Published Oct 19, 2024 • 2 • 2