Lei Wang's picture

2 253

Lei Wang

demolei

·

https://demoleiwang.github.io/HomePage/

AI & ML interests

LLMs

Recent Activity

upvoted a paper 2 days ago

A Survey of Context Engineering for Large Language Models

upvoted a paper 4 days ago

AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs

upvoted a paper 4 days ago

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

View all activity

Organizations

upvoted a paper 2 days ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published 2 days ago • 122

upvoted 3 papers 4 days ago

AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs

Paper • 2507.08616 • Published 9 days ago • 9

A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning

Paper • 2507.08267 • Published 9 days ago • 9

OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique

Paper • 2507.09075 • Published 8 days ago • 6

upvoted a paper 5 days ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published 12 days ago • 47

upvoted a collection 7 days ago

VisionLM

1336 items • Updated 2 days ago • 84

upvoted a paper 10 days ago

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published 11 days ago • 67

upvoted 3 papers 11 days ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published 11 days ago • 82

CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization

Paper • 2507.06181 • Published 11 days ago • 39

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Paper • 2507.03112 • Published 16 days ago • 31

upvoted 3 papers 19 days ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published 21 days ago • 60

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published 19 days ago • 44

Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective

Paper • 2506.17930 • Published 28 days ago • 19

upvoted 2 papers 26 days ago

LettinGo: Explore User Profile Generation for Recommendation System

Paper • 2506.18309 • Published 27 days ago • 11

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published 26 days ago • 56

upvoted 5 papers about 1 month ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 254

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12 • 71

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 66

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11 • 98