Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shaobai Jiang's picture
2 8

Shaobai Jiang

shaobaij
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago
Kimi-VL Technical Report
upvoted a paper 27 days ago
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
upvoted a paper about 2 months ago
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback
View all activity

Organizations

None yet

shaobaij's activity

upvoted a paper 22 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 124
upvoted a paper 27 days ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 84
upvoted 3 papers about 2 months ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 61

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 112

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published Mar 13 • 28
upvoted 2 papers 2 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 115

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 391
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs