1 45 187

Xiaosen Zheng

xszheng2020

AI & ML interests

Data-Centric AI and AI Safety.

Recent Activity

liked a model 3 days ago

lkevinzc/Llama-3.2-3B-NuminaQA

liked a model 13 days ago

GAIR/LIMO

liked a dataset 13 days ago

GAIR/MathPile

View all activity

Organizations

xszheng2020's activity

liked a model 3 days ago

lkevinzc/Llama-3.2-3B-NuminaQA

Text Generation • Updated 13 days ago • 956 • 3

liked a model 13 days ago

GAIR/LIMO

Updated Feb 6 • 8.47k • 37

liked a dataset 13 days ago

GAIR/MathPile

Preview • Updated about 1 hour ago • 395 • 186

upvoted a collection 13 days ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated Jan 14 • 79

liked 2 datasets 17 days ago

K-and-K/perturbed-knights-and-knaves

Viewer • Updated Oct 31, 2024 • 41.2k • 584 • 7

K-and-K/knights-and-knaves

Viewer • Updated Oct 31, 2024 • 6.9k • 1.96k • 26

liked a dataset 27 days ago

simplescaling/data_ablation_full59K

Viewer • Updated Feb 3 • 60.4k • 1.56k • 19

upvoted a paper 27 days ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 28 days ago • 102

liked a model about 1 month ago

m-a-p/neo_7b

Text Generation • Updated Jun 3, 2024 • 223 • 55

upvoted a collection about 1 month ago

OLMo 2 Preview Post-trained Models

Collection

These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions. • 6 items • Updated 21 days ago • 4

liked a model about 1 month ago

allenai/OLMo-2-1124-7B-Instruct

Text Generation • Updated Jan 6 • 10.1k • 31

liked a dataset about 1 month ago

simplescaling/s1K-1.1

Viewer • Updated Feb 27 • 1k • 6.47k • 103

upvoted a paper about 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 216

liked a model about 2 months ago

allenai/OLMo-1B-0724-hf

Text Generation • Updated Aug 5, 2024 • 82.2k • 20

liked a dataset about 2 months ago

lkevinzc/CountDownZero

Viewer • Updated Feb 1 • 329k • 122 • 1

liked a model about 2 months ago

Qwen/Qwen2.5-0.5B

Text Generation • Updated Sep 25, 2024 • 936k • • 246

liked a dataset 2 months ago

Jiayi-Pan/Countdown-Tasks-3to4

Viewer • Updated Jan 23 • 490k • 9.33k • 49

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 7 days ago • 7.83k • 885

upvoted a collection 2 months ago

DeepSeek-R1

Collection

8 items • Updated Jan 21 • 598