Zengzhi Wang's picture

Zengzhi Wang

SinclairWang

·

https://tinyurl.com/zengzhi-homepage

AI & ML interests

Data Engineering for Generative AI

Organizations

New activity in OctoThinker/MegaMath-Web-Pro-Max 11 months ago

Still uploading, please stay tuned.

#1 opened 12 months ago by

New activity in LLM360/MegaMath 11 months ago

Questions on Deduplication Strategy, Temporal Metadata and Representation of Structured Content

#8 opened about 1 year ago by

New activity in OctoThinker/MegaMath-Web-Pro-Max 12 months ago

Upload folder using huggingface_hub

#2 opened 12 months ago by

commented a paper 12 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25, 2025 • 49 •

New activity in finemath/final_soup about 1 year ago

[WIP] Upload folder using huggingface_hub (multi-commit 670c61ff427293798eb7c582171dc0d5b8d4ac5740b10f79640b6523327dad49)

#5 opened about 1 year ago by

Upload folder using huggingface_hub

#4 opened about 1 year ago by

Upload folder using huggingface_hub

#3 opened about 1 year ago by

[WIP] Upload folder using huggingface_hub (multi-commit bdc6a3514c54d904d43991be630b970fb11527b37131cabb4b9143452d9223b8)

#2 opened about 1 year ago by

Upload folder using huggingface_hub

#1 opened about 1 year ago by

commented a paper about 1 year ago

NEMOTRON-CROSSTHINK: Scaling Self-Learning beyond Math Reasoning

Paper • 2504.13941 • Published Apr 15, 2025 • 12 •

New activity in gair-prox/DCLM-pro over 1 year ago

Upload folder using huggingface_hub

#5 opened over 1 year ago by

New activity in GAIR/OlympicArenaSubmission almost 2 years ago

Add paper link to connect the Space to its paper on Daily Papers page

#1 opened almost 2 years ago by

New activity in GAIR/OlympicArena almost 2 years ago

Add paper link

#2 opened almost 2 years ago by

commented a paper almost 2 years ago

OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

Paper • 2406.16772 • Published Jun 24, 2024 • 2 •

New activity in GAIR/MathPile almost 2 years ago

arxiv macros and figure environment seem mis-handled

#1 opened over 2 years ago by

New activity in open-llm-leaderboard/open_llm_leaderboard about 2 years ago

💬 Discussion thread: Model contamination techniques 💬

#472 opened over 2 years ago by

commented a paper about 2 years ago

Benchmarking Benchmark Leakage in Large Language Models

Paper • 2404.18824 • Published Apr 29, 2024 • 6 •

New activity in open-llm-leaderboard/open_llm_leaderboard about 2 years ago

Tool: Space to test model contamination

#486 opened over 2 years ago by

New activity in GAIR/MathPile_Commercial over 2 years ago

CastError on dataset loading

#2 opened over 2 years ago by

Casting error when load the dataset?

#1 opened over 2 years ago by