Quanquan Gu's picture

Quanquan Gu

thughost

·

QuanquanGu

AI & ML interests

None yet

Organizations

Posts 2

Post

767

We've open-sourced the code and models for Self-Play Preference Optimization (SPPO)! 🚀🚀🚀
🤗paper: Self-Play Preference Optimization for Language Model Alignment (2405.00675)
⭐ code: https://github.com/uclaml/SPPO
🤗models: UCLA-AGI/sppo-6635fdd844f2b2e4a94d0b9a

Post

Check out the demo of SPIN-Diffusion made by @angelahzyuan at: UCLA-AGI/SPIN-Diffusion-demo-v1

Papers 30

arxiv:2512.24354

arxiv:2501.06425

arxiv:2411.10438

arxiv:2410.13782

models 0

None public yet

datasets 0

None public yet