ilia kulikov's picture

6

ilia kulikov

uralik

·

uralik

AI & ML interests

None yet

Organizations

authored a paper 3 months ago

J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning

Paper • 2505.10320 • Published May 15 • 23

authored a paper 9 months ago

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10

authored 2 papers about 1 year ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5, 2024 • 30

Investigating Decoder-only Large Language Models for Speech-to-text Translation

Paper • 2407.03169 • Published Jul 3, 2024 • 11