PRIME-RL (PRIME)

ramiroluo

submitted a paper to Daily Papers 3 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 37

ramiroluo

in PRIME-RL/P1-VL-30B-A3B 3 months ago

Add metadata and link to paper/code

#1 opened 3 months ago by

nielsr

ramiroluo

in PRIME-RL/P1-VL-235B-A22B 3 months ago

Add metadata and links to paper and code

#1 opened 3 months ago by

nielsr

ramiroluo

authored 2 papers 3 months ago

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9, 2025 • 32

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

ramiroluo

updated a model 3 months ago

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 11 • 3

ramiroluo

published 2 models 3 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 29 • 3

PRIME-RL/P1-VL-235B-A22B

Image-Text-to-Text • 236B • Updated Feb 12 • 11 • 3

ramiroluo

updated a model 3 months ago

PRIME-RL/P1-VL-30B-A3B

Image-Text-to-Text • 31B • Updated Feb 12 • 29 • 3

JC-Chen

authored 5 papers 5 months ago

Symbol: Generating Flexible Black-Box Optimizers through Symbolic Equation Learning

Paper • 2402.02355 • Published Feb 4, 2024

LLaMoCo: Instruction Tuning of Large Language Models for Optimization Code Generation

Paper • 2403.01131 • Published Mar 2, 2024

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling

Paper • 2508.08636 • Published Aug 12, 2025 • 2

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Paper • 2509.07894 • Published Sep 9, 2025 • 32

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

stingning

updated a Space 6 months ago

README

🏃

JC-Chen

published a model 6 months ago

PRIME-RL/P1-30B-A3B

Text Generation • 31B • Updated Oct 24, 2025 • 239 • 10

JC-Chen

updated 2 models 6 months ago

PRIME-RL/P1-30B-A3B

Text Generation • 31B • Updated Oct 24, 2025 • 239 • 10

PRIME-RL/P1-235B-A22B

Text Generation • 235B • Updated Oct 24, 2025 • 20 • 20

JC-Chen

published a model 6 months ago

PRIME-RL/P1-235B-A22B

Text Generation • 235B • Updated Oct 24, 2025 • 20 • 20

ganqu

authored a paper 7 months ago

V-GameGym: Visual Game Generation for Code Large Language Models

Paper • 2509.20136 • Published Sep 24, 2025 • 8

AI & ML interests

Team members 8

PRIME-RL's activity

Add metadata and link to paper/code

Add metadata and links to paper and code

README