sunblaze-ucb

https://github.com/sunblaze-ucb

AI & ML interests

None defined yet.

Recent Activity

dylanx26 updated a dataset 4 days ago

sunblaze-ucb/AgentSynth

Dongwei authored a paper 4 months ago

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Xuandong authored a paper 7 months ago

Learning to Reason without External Rewards

View all activity

Organization Card

Community About org cards

Open-source models and datasets from Dawn Song's research group at UC Berkeley. GitHub Repos

Collections 1

models 14

sunblaze-ucb/Qwen2.5-1.5B-Intuitor-MATH-1EPOCH

Text Generation • 2B • Updated Aug 13 • 19

sunblaze-ucb/OLMo-2-7B-SFT-GRPO-MATH-1EPOCH

Text Generation • 7B • Updated Aug 13 • 23

sunblaze-ucb/OLMo-2-7B-SFT-Intuitor-MATH-1EPOCH

Text Generation • 7B • Updated Aug 13 • 15

sunblaze-ucb/Qwen3-14B-Intuitor-MATH-1EPOCH

Text Generation • 15B • Updated Aug 13 • 27

sunblaze-ucb/Qwen2.5-1.5B-GRPO-MATH-1EPOCH

Text Generation • 2B • Updated Aug 13 • 14

sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH

Text Generation • 15B • Updated Aug 13 • 15

sunblaze-ucb/Qwen2.5-3B-Intuitor-MATH-1EPOCH

Text Generation • 3B • Updated Aug 13 • 18

sunblaze-ucb/Qwen2.5-3B-GRPO-MATH-1EPOCH

Text Generation • 3B • Updated Aug 13 • 13

sunblaze-ucb/Qwen2.5-14B-Intuitor-MATH-1EPOCH

Text Generation • 15B • Updated Jul 2 • 7

sunblaze-ucb/Qwen2.5-7B-Intuitor-MATH-1EPOCH

Text Generation • 8B • Updated Jul 2 • 10

datasets 6

sunblaze-ucb/cybergym-poc

Updated Nov 11 • 12

sunblaze-ucb/cybergym-source

Updated Nov 11 • 13

sunblaze-ucb/AgentSynth

Viewer • Updated Sep 1 • 1.21k • 226 • 5

sunblaze-ucb/verina

Viewer • Updated Aug 17 • 189 • 267 • 6

sunblaze-ucb/cybergym-server

Updated May 16 • 324

sunblaze-ucb/cybergym

Viewer • Updated May 15 • 1.51k • 8.96k • 10