Konstantin Grotov's picture

3 2

Konstantin Grotov

konstantgr

·

AI & ML interests

None yet

Recent Activity

updated a model 2 months ago

JetBrains-Research/Qwen3-30B-A3B-am

upvoted a paper 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

authored a paper 3 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

View all activity

Organizations

updated a model 2 months ago

JetBrains-Research/Qwen3-30B-A3B-am

31B • Updated Oct 29, 2025 • 1

upvoted a paper 2 months ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published Oct 27, 2025 • 20

authored a paper 3 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29, 2025 • 37

New activity in JetBrains-Research/PIPer-8B-RL-only 3 months ago

Improve model card: Add paper and code badges, update datasets metadata

#1 opened 3 months ago by

New activity in JetBrains-Research/PIPer-8B 3 months ago

Improve model card: Add paper and code links

#1 opened 3 months ago by

updated a collection 3 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3

upvoted a paper 3 months ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

Paper • 2509.25455 • Published Sep 29, 2025 • 37

updated a collection 3 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3

updated a dataset 3 months ago

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated Oct 2, 2025 • 742 • 157 • 1

published a dataset 3 months ago

JetBrains-Research/PIPer-envbench-zeroshot-rl

Viewer • Updated Oct 2, 2025 • 742 • 157 • 1

updated a collection 3 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3

updated a dataset 3 months ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated Oct 2, 2025 • 2.5k • 37 • 1

published a dataset 3 months ago

JetBrains-Research/PIPer-SFT-2500-sharegpt

Viewer • Updated Oct 2, 2025 • 2.5k • 37 • 1

updated a collection 3 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3

updated a dataset 3 months ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30, 2025 • 716

published a dataset 3 months ago

JetBrains-Research/PIPer-eval

Preview • Updated Sep 30, 2025 • 716

updated a collection 3 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3

updated a model 3 months ago

JetBrains-Research/Qwen3-8B-am

Text Generation • 8B • Updated Sep 30, 2025 • 101

updated a collection 3 months ago

🦫 PIPer

All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3