Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wanicca's picture
1 3 16

wanicca

wanicca
21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago
UQ: Assessing Language Models on Unsolved Questions
liked a model 20 days ago
HKUSTAudio/Llasa-3B
upvoted a paper about 1 month ago
UloRL:An Ultra-Long Output Reinforcement Learning Approach for Advancing Large Language Models' Reasoning Abilities
View all activity

Organizations

Synthia's profile picture Tencent's profile picture Hugging Face Discord Community's profile picture

Collections 1

RL papers
  • TTRL: Test-Time Reinforcement Learning

    Paper • 2504.16084 • Published Apr 22 • 120
RL papers
  • TTRL: Test-Time Reinforcement Learning

    Paper • 2504.16084 • Published Apr 22 • 120

models 0

None public yet

datasets 1

wanicca/WikiHowQA-mnbvc

Viewer • Updated Sep 4, 2023 • 90.1k • 102 • 8
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs