11 12 5

Dawei Zhu

dwzhu

dwzhu-pku

AI & ML interests

natural language processing

Recent Activity

authored a paper about 10 hours ago

MiMo-VL Technical Report

upvoted a paper about 14 hours ago

MiMo-VL Technical Report

upvoted a collection 5 days ago

MiMo-VL

View all activity

Organizations

dwzhu's activity

upvoted a paper about 14 hours ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published 1 day ago • 53

upvoted a collection 5 days ago

MiMo-VL

Collection

2 items • Updated 7 days ago • 23

upvoted a paper 24 days ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published 24 days ago • 77

upvoted a paper 2 months ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 49

upvoted a paper 3 months ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published Mar 4 • 27

upvoted an article 7 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

•

Oct 24, 2024

• 12

upvoted 2 papers 8 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 76

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 32

upvoted a paper about 1 year ago

LongEmbed: Extending Embedding Models for Long Context Retrieval

Paper • 2404.12096 • Published Apr 18, 2024 • 2

upvoted a collection over 1 year ago

Attention

Collection

128 items • Updated Mar 15 • 4

upvoted a paper over 1 year ago

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 26

upvoted a collection over 1 year ago

Long context

Collection

94 items • Updated Sep 29, 2024 • 32