HKUST NLP Group

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ZhaoweiWang submitted a paper about 1 month ago

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

ksshumab authored a paper about 1 month ago

SWE-RM: Execution-free Feedback For Software Engineering Agents

ksshumab authored a paper about 1 month ago

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

View all activity

Papers

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

View all Papers

Collections 12

View 12 collections

models 66

datasets 32

hkust-nlp/drkernel-validation-data

Viewer • Updated Feb 6 • 100 • 59 • 1

hkust-nlp/drkernel-rl-data

Viewer • Updated Feb 6 • 72k • 52

hkust-nlp/drkernel-coldstart-8k

Viewer • Updated Feb 6 • 8.92k • 46 • 2

hkust-nlp/Toolathlon-Trajectories

Preview • Updated Dec 5, 2025 • 3.35k • 21

hkust-nlp/WebExplorer-QA

Viewer • Updated Nov 22, 2025 • 100 • 119 • 7

hkust-nlp/CodeIO-PyEdu-Reasoning-Raw

Updated Jun 18, 2025 • 64 • 2

hkust-nlp/CodeIO-PyEdu-Reasoning

Preview • Updated Jun 18, 2025 • 148 • 58

hkust-nlp/rl-verifier-pitfalls_hacking_data

Viewer • Updated May 28, 2025 • 6.12k • 17 • 1

hkust-nlp/deepscaler_simplelr

Viewer • Updated May 28, 2025 • 40.3k • 33

hkust-nlp/Laser-Deepscaler-Dataset

Viewer • Updated May 21, 2025 • 40.8k • 78

View 32 datasets

HKUST NLP Group

AI & ML interests

Recent Activity

Papers

Collections 12

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

hkust-nlp/drkernel-14b

hkust-nlp/drkernel-8b

hkust-nlp/drkernel-14b-coldstart

hkust-nlp/Toolathlon-Trajectories

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

hkust-nlp/drkernel-14b

hkust-nlp/drkernel-8b

hkust-nlp/drkernel-14b-coldstart

hkust-nlp/Toolathlon-Trajectories

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

models 66

hkust-nlp/drkernel-8b-coldstart

hkust-nlp/drkernel-14b-coldstart

hkust-nlp/drkernel-14b

hkust-nlp/drkernel-8b

hkust-nlp/WebExplorer-8B

hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B

hkust-nlp/Qwen-2.5-7B-Verifier-HF

hkust-nlp/R1-Distill-Verifier-1.5B

hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B

datasets 32

hkust-nlp/drkernel-validation-data

hkust-nlp/drkernel-rl-data

hkust-nlp/drkernel-coldstart-8k

hkust-nlp/Toolathlon-Trajectories

hkust-nlp/WebExplorer-QA

hkust-nlp/CodeIO-PyEdu-Reasoning-Raw

hkust-nlp/CodeIO-PyEdu-Reasoning

hkust-nlp/rl-verifier-pitfalls_hacking_data

hkust-nlp/deepscaler_simplelr

hkust-nlp/Laser-Deepscaler-Dataset

AI & ML interests

Recent Activity

Papers

Team members 15

Collections 12

models 66 Sort: Recently updated

datasets 32 Sort: Recently updated

models 66

datasets 32