Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wei Fu's picture
2

Wei Fu

garrett4wade
21world's profile picture
·
  • garrett4wade

AI & ML interests

RL

Recent Activity

upvoted a paper about 1 month ago
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments
upvoted a paper about 1 month ago
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
updated a dataset 5 months ago
inclusionAI/AReaL-RL-Data
View all activity

Organizations

inclusionAI's profile picture

upvoted 2 papers about 1 month ago

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Paper • 2506.02387 • Published Jun 3 • 57

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Paper • 2505.24298 • Published May 30 • 26
updated a dataset 5 months ago

inclusionAI/AReaL-RL-Data

Preview • Updated Mar 28 • 528 • 3
authored 2 papers about 1 year ago

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Paper • 2306.16688 • Published Jun 29, 2023

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16, 2024 • 6
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs