Zhiwei He's picture

Zhiwei He

zwhe99

·

https://zwhe99.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 21 days ago

SkillNet: Create, Evaluate, and Connect AI Skills

liked a model about 1 month ago

MiniMaxAI/MiniMax-M2.5

upvoted a paper about 2 months ago

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

View all activity

Organizations

None yet

upvoted a paper 21 days ago

SkillNet: Create, Evaluate, and Connect AI Skills

Paper • 2603.04448 • Published 29 days ago • 91

upvoted a paper about 2 months ago

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Paper • 2601.17737 • Published Jan 25 • 56

upvoted a paper 5 months ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published Nov 7, 2025 • 57

upvoted a collection 6 months ago

DeepSeek-V3.2

4 items • Updated Dec 1, 2025 • 535

upvoted a paper 7 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 82

upvoted a paper 9 months ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8, 2025 • 76

upvoted a paper 10 months ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104

upvoted a paper 11 months ago

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Paper • 2505.02847 • Published May 1, 2025 • 30

upvoted 2 collections 11 months ago

Qwen3

84 items • Updated Dec 31, 2025 • 1.73k

DeepMath

A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning • 5 items • Updated May 22, 2025 • 5

upvoted 2 papers 11 months ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published Apr 22, 2025 • 64

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15, 2025 • 12

upvoted a paper about 1 year ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

upvoted a collection over 1 year ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 252