Lei Wang's picture

2 253

Lei Wang

demolei

·

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 3 days ago

A Survey of Context Engineering for Large Language Models

upvoted a paper 5 days ago

AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs

upvoted a paper 5 days ago

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

View all activity

Organizations

upvoted a paper 3 days ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published 3 days ago • 144

upvoted 3 papers 5 days ago

AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs

Paper • 2507.08616 • Published 10 days ago • 11

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

Paper • 2507.08267 • Published 10 days ago • 9

OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique

Paper • 2507.09075 • Published 9 days ago • 8

upvoted a paper 6 days ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published 13 days ago • 48

upvoted a collection 8 days ago

VisionLM

1338 items • Updated about 23 hours ago • 85

upvoted a paper 11 days ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published 12 days ago • 67

upvoted 3 papers 12 days ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published 12 days ago • 82

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Paper • 2507.06181 • Published 12 days ago • 39

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Paper • 2507.03112 • Published 17 days ago • 31

upvoted 3 papers 20 days ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published 22 days ago • 60

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published 20 days ago • 44

Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective

Paper • 2506.17930 • Published 29 days ago • 19

upvoted 2 papers 27 days ago

LettinGo: Explore User Profile Generation for Recommendation System

Paper • 2506.18309 • Published 28 days ago • 11

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published 27 days ago • 56

upvoted 5 papers about 1 month ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 254

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12 • 71

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 66

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11 • 98