Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Quanquan Gu's picture
9 14 24

Quanquan Gu

thughost
JadeAuroraLi's profile picture akhaliq's profile picture charlesdedampierre's profile picture
·
  • QuanquanGu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago
On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning
authored a paper 6 months ago
Tensor Product Attention Is All You Need
upvoted a paper 6 months ago
Tensor Product Attention Is All You Need
View all activity

Organizations

math-dataset's profile picture UCLA Statistical Machine Learning Lab's profile picture UCLA Artificial General Intelligence Lab's profile picture Social Post Explorers's profile picture General Preference's profile picture Singularity AGI's profile picture

Posts 2

view post
Post
735
We've open-sourced the code and models for Self-Play Preference Optimization (SPPO)! 🚀🚀🚀
🤗paper: Self-Play Preference Optimization for Language Model Alignment (2405.00675)
⭐ code: https://github.com/uclaml/SPPO
🤗models: UCLA-AGI/sppo-6635fdd844f2b2e4a94d0b9a
view post
Post
Check out the demo of SPIN-Diffusion made by @angelahzyuan at: UCLA-AGI/SPIN-Diffusion-demo-v1
View all Posts

Papers 29

arxiv:2501.06425
arxiv:2411.10438
arxiv:2410.13782
arxiv:2410.02712

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs