4 10 8

Wenqi Shi

wshi83

https://wshi83.github.io

AI & ML interests

LLMs, Generative AI, Data-Centric AI

Recent Activity

authored a paper about 1 month ago

CellForge: Agentic Design of Virtual Cell Models

authored a paper about 1 month ago

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

authored a paper about 1 month ago

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

View all activity

Organizations

authored 4 papers about 1 month ago

CellForge: Agentic Design of Virtual Cell Models

Paper • 2508.02276 • Published Aug 4, 2025 • 39

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

Paper • 2509.24193 • Published Sep 29, 2025 • 6

Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards

Paper • 2509.21882 • Published Sep 26, 2025

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published Nov 24, 2025 • 9

upvoted a paper about 1 month ago

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published Nov 24, 2025 • 9

commented a paper about 1 month ago

Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs

Paper • 2511.19773 • Published Nov 24, 2025 • 9 •

upvoted a paper 3 months ago

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

Paper • 2509.24193 • Published Sep 29, 2025 • 6

liked a dataset 4 months ago

MedAgentGym/MedAgentGym-Data

Preview • Updated Jul 24, 2025 • 9 • 1

liked 2 models 4 months ago

MedAgentGym/MedCopilot-7B

8B • Updated Jun 1, 2025 • 17 • 4

MedAgentGym/MedCopilot-14B

15B • Updated Jun 1, 2025 • 15 • 2

liked a dataset 4 months ago

MedAgentGym/SampledTrajs

Viewer • Updated Jun 1, 2025 • 21.4k • 2.1k • 4

upvoted a paper 5 months ago

CellForge: Agentic Design of Virtual Cell Models

Paper • 2508.02276 • Published Aug 4, 2025 • 39

updated a dataset 5 months ago

MedAgentGym/MedAgentGym-Data

Preview • Updated Jul 24, 2025 • 9 • 1

upvoted a paper 6 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20, 2025 • 85

published a dataset 6 months ago

MedAgentGym/MedAgentGym-Data

Preview • Updated Jul 24, 2025 • 9 • 1

upvoted a paper 6 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8, 2025 • 75

liked a dataset 6 months ago

wshi83/EHRAgent-treqs

Updated Feb 13, 2024 • 46 • 3

authored 2 papers 7 months ago

Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration

Paper • 2504.04915 • Published Apr 7, 2025

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Paper • 2506.04405 • Published Jun 4, 2025 • 7

upvoted a paper 7 months ago

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Paper • 2506.04405 • Published Jun 4, 2025 • 7

Wenqi Shi

AI & ML interests

Recent Activity

Organizations

wshi83's activity