Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Runlong Zhou's picture
1

Runlong Zhou

vectorzhou
ypwang61's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation
updated a dataset about 1 month ago
vectorzhou/AIME_2025_DeepSeek_R1_0528_Temp_1.0_L_16384
published a dataset about 1 month ago
vectorzhou/AIME_2025_DeepSeek_R1_0528_Temp_1.0_L_16384
View all activity

Organizations

None yet

authored 4 papers 4 months ago

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Paper • 2310.19308 • Published Oct 30, 2023

Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes

Paper • 2210.11604 • Published Oct 20, 2022

Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques

Paper • 2409.00717 • Published Sep 1, 2024

Extragradient Preference Optimization (EGPO): Beyond Last-Iterate Convergence for Nash Learning from Human Feedback

Paper • 2503.08942 • Published Mar 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs