4 24 184

PeijieDong

pprp

https://pprp.github.io

AI & ML interests

Model Compression; Large Language Model;

Recent Activity

liked a model 2 days ago

rednote-hilab/dots.llm1.inst

liked a dataset 2 days ago

a-m-team/AM-DeepSeek-Distilled-40M

liked a model 2 days ago

a-m-team/AM-Thinking-v1

View all activity

Organizations

None yet

pprp's activity

upvoted a paper 11 days ago

Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression

Paper • 2505.19433 • Published 13 days ago • 5

upvoted a collection 29 days ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated 20 days ago • 148

upvoted a paper about 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 268

upvoted a paper 4 months ago

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61

upvoted an article 4 months ago

Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

and 1 other •

May 2, 2022

• 4

upvoted a paper 4 months ago

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Paper • 2502.04411 • Published Feb 6 • 4

upvoted an article 5 months ago

Article

Token Merging for fast LLM inference : Background and first trials with Mistral

•

Apr 30, 2024

• 4

upvoted 7 papers 8 months ago

Should We Really Edit Language Models? On the Evaluation of Edited Language Models

Paper • 2410.18785 • Published Oct 24, 2024 • 7

FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published Oct 12, 2024 • 15

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 7

LPZero: Language Model Zero-cost Proxy Search from Zero

Paper • 2410.04808 • Published Oct 7, 2024 • 2

Benchmarking Agentic Workflow Generation

Paper • 2410.07869 • Published Oct 10, 2024 • 27

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Paper • 2410.05265 • Published Oct 7, 2024 • 32

LongGenBench: Long-context Generation Benchmark

Paper • 2410.04199 • Published Oct 5, 2024 • 22

upvoted an article 10 months ago

Article

LLM Data Engineering 3——Data Collection Magic: Acquiring Top Training Data

•

Jun 4, 2024

• 4

upvoted a collection 10 months ago

Google Gemma2

Collection

24 items • Updated Oct 22, 2024 • 15

upvoted an article 10 months ago

Article

Welcome Gemma 2 - Google's new open LLM

and 5 others •

Jun 27, 2024

• 129

upvoted a collection 10 months ago

🪐 SmolLM

Collection

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 227

upvoted a paper 10 months ago

Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for Large Language Models

Paper • 2406.02924 • Published Jun 5, 2024 • 2

upvoted an article about 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

and 4 others •

May 24, 2023

• 152