XXXXXXXLLM's picture

1 1 13

XXXXXXXLLM

TTTXXX01

·

AI & ML interests

None yet

Recent Activity

published a model about 5 hours ago

TTTXXX01/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

updated a model 1 day ago

TTTXXX01/Qwen2.5-1.5B-Open-R1-GRPO

updated a model 2 days ago

TTTXXX01/Qwen2.5-1.5B-Open-R1-Distill

View all activity

Organizations

None yet

TTTXXX01's activity

commented a paper 10 months ago

Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 27 •