LLM360

community

Verified

https://www.llm360.ai

llm360

llm360

AI & ML interests

None defined yet.

Recent Activity

zhoujun new activity 3 days ago

LLM360/guru-RL-92k:Add task category, library name, and link to paper and code

zhoujun new activity 3 days ago

LLM360/guru-RL-92k:Update README.md

SinclairWang authored a paper 3 days ago

MegaMath: Pushing the Limits of Open Math Corpora

View all activity

zhoujun

in LLM360/guru-RL-92k 3 days ago

Add task category, library name, and link to paper and code

#3 opened 11 days ago by

Update README.md

#12 opened 3 days ago by

SinclairWang

authored 2 papers 3 days ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 31

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published 5 days ago • 34

koalazf99

authored a paper 3 days ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published 5 days ago • 34

zhoujun

updated a dataset 3 days ago

LLM360/guru-RL-92k

Viewer • Updated 3 days ago • 91.9k • 428 • 14

zhoujun

in LLM360/guru-RL-92k 3 days ago

Test set is empty?

#10 opened 9 days ago by

Update README.md

#11 opened 3 days ago by

zhoujun

updated 2 datasets 4 days ago

LLM360/guru-RL-92k-extra-info-compressed

Viewer • Updated 4 days ago • 105k • 129

LLM360/guru_preview

Viewer • Updated 4 days ago • 91.9k • 51

twkillian

authored a paper 7 days ago

Medical Dead-ends and Learning to Identify High-risk States and Treatments

Paper • 2110.04186 • Published Oct 8, 2021

moonfolk

authored a paper 7 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 12 days ago • 43

realYuanLi

authored a paper 7 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 12 days ago • 43

twkillian

authored a paper 7 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 12 days ago • 43

koalazf99

authored a paper 10 days ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 12 days ago • 43

zhoujun

in LLM360/guru-RL-92k 10 days ago

Delete offline_eval/math__aime2025_repeated_8x_240.parquet

#9 opened 10 days ago by

Delete online_eval/logic__graph_logical_dataset_77.parquet

#8 opened 10 days ago by

Delete online_eval/codegen__leetcode2k_386.parquet

#7 opened 10 days ago by

Delete online_eval/math__minerva_272.parquet

#6 opened 10 days ago by

Rename online_eval/table__hitab_300.parquet to online_eval/table__hitab_200.parquet

#5 opened 10 days ago by