J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning Paper • 2505.10320 • Published May 15 • 23
Adaptive Decoding via Latent Preference Optimization Paper • 2411.09661 • Published Nov 14, 2024 • 10
Investigating Decoder-only Large Language Models for Speech-to-text Translation Paper • 2407.03169 • Published Jul 3, 2024 • 11