Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mobbb's picture
4 7

mobbb

mobbb0
·

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago
GY2233/Meta_llama_3.1_8b_instruct_expanded
liked a dataset 10 days ago
GY2233/code_generation_filtered
upvoted a paper 10 days ago
TTRL: Test-Time Reinforcement Learning
View all activity

Organizations

None yet

mobbb0's activity

upvoted a paper 10 days ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 112
upvoted a collection 10 days ago

UltraIF series

Collection
Open-Sourced model and data for ULTRAIF: Advancing Instruction Following from the Wild. • 6 items • Updated Apr 3 • 2
upvoted 2 papers 10 days ago

UltraIF: Advancing Instruction Following from the Wild

Paper • 2502.04153 • Published Feb 6 • 24

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published 11 days ago • 68
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs