Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
26
21
seongyun_lee
Seongyun
Follow
daniel0098's profile picture
juyoungml's profile picture
samusenps's profile picture
7 followers
·
9 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
How to Train Your LLM Web Agent: A Statistical Diagnosis
upvoted
a
paper
9 days ago
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
authored
a paper
about 2 months ago
The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think
View all activity
Organizations
Papers
11
arxiv:
2505.10185
arxiv:
2504.17192
arxiv:
2503.19877
arxiv:
2412.03679
Expand 11 papers
models
39
Sort: Recently updated
Seongyun/exaone_deep_2.4b_non_math_only_mcqa_format
Updated
Apr 2
Seongyun/non_math_only_mcqa_format
2B
•
Updated
Mar 31
•
3
Seongyun/math_only_mcqa_format
2B
•
Updated
Mar 29
•
5
Seongyun/rlpvr-mcqa-only-small-full-opt20
Text Generation
•
2B
•
Updated
Mar 23
•
10
Seongyun/rlpvr-mcqa-only-unverifiable
Text Generation
•
2B
•
Updated
Mar 22
•
6
Seongyun/rlpvr-mcqa-only-small-full-opt10
2B
•
Updated
Mar 16
•
5
Seongyun/pretrained-rlpvr-v1.9-small-full
2B
•
Updated
Mar 15
•
3
Seongyun/instruction-tuned-rlpvr-v1.9-small-full
2B
•
Updated
Mar 14
•
3
Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_6
Updated
Mar 10
Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_5
2B
•
Updated
Mar 10
•
2
View 39 models
datasets
2
Sort: Recently updated
Seongyun/human_eval_1
Viewer
•
Updated
Apr 12
•
100
•
25
Seongyun/flan-zs-noopt-thought-aug
Updated
Dec 22, 2024
•
73