IndicMMLU-Pro: Benchmarking Indic Large Language Models on Multi-Task Language Understanding Paper • 2501.15747 • Published 25 days ago • 7
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization Paper • 2501.03271 • Published Jan 5 • 11