Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mashiro's picture
5

Mashiro

AlexMashiro

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago
InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training
upvoted a paper 5 days ago
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
upvoted a paper 30 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
View all activity

Organizations

None yet

upvoted a paper 2 days ago

InfiMed-ORBIT: Aligning LLMs on Open-Ended Complex Tasks via Rubric-Based Incremental Training

Paper • 2510.15859 • Published Oct 17, 2025 • 11
upvoted a paper 5 days ago

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning

Paper • 2508.16949 • Published Aug 23, 2025 • 23
upvoted a paper 30 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 60
upvoted a paper about 1 month ago

Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning

Paper • 2509.25534 • Published Sep 19, 2025 • 2
upvoted a paper about 2 months ago

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

Paper • 2509.21500 • Published Sep 25, 2025 • 18
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs