The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 9 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 5 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 6 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 6
The collection for the Paper "WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents"
The collection for the Paper "Pitfalls of Rule- and Model-based Verifiers: A Case Study on Mathematical Reasoning."
-
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 9 • 1 -
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 5 • 1 -
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 6 -
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 6
models
62
hkust-nlp/WebExplorer-8B
Image-Text-to-Text
•
8B
•
Updated
•
188
•
9
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier
Reinforcement Learning
•
8B
•
Updated
•
8
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning
•
8B
•
Updated
•
6
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning
•
8B
•
Updated
•
6
hkust-nlp/R1-Distill-Verifier-1.5B
2B
•
Updated
•
5
•
1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning
•
8B
•
Updated
•
9
•
1
hkust-nlp/Laser-DE-L4096-1.5B
2B
•
Updated
•
816
hkust-nlp/Laser-DE-L2048-1.5B
2B
•
Updated
•
6
hkust-nlp/Laser-DE-L1024-1.5B
2B
•
Updated
•
5
hkust-nlp/Laser-D-L4096-1.5B
2B
•
Updated
•
230
datasets
28
hkust-nlp/WebExplorer-QA
Viewer
•
Updated
•
100
•
286
•
4
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated
•
35
•
2
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
•
Updated
•
69
•
53
hkust-nlp/rl-verifier-pitfalls_hacking_data
Viewer
•
Updated
•
6.12k
•
16
•
1
hkust-nlp/deepscaler_simplelr
Viewer
•
Updated
•
40.3k
•
25
hkust-nlp/Laser-Deepscaler-Dataset
Viewer
•
Updated
•
40.8k
•
86
hkust-nlp/LeetCode-O
Preview
•
Updated
•
52
hkust-nlp/GUIMid
Viewer
•
Updated
•
1.85M
•
240
•
5
hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
•
53.1k
•
321
•
6
hkust-nlp/PreSelect-100B
Viewer
•
Updated
•
54.5M
•
376
•
11