dongXL (dong xiaolong)

upvoted a collection 8 months ago

DeepSeek-V3.2

Collection

4 items • Updated Dec 1, 2025 • 544

upvoted an article about 1 year ago

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 857

upvoted a paper about 1 year ago

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Paper • 2503.01506 • Published Mar 3, 2025 • 10

upvoted 2 articles over 1 year ago

Article

Zero to Hero with the TRL learning link bomb 💣

burtenshaw

•

Nov 25, 2024

• 8

Article

LLM Inference at scale with TGI

martinigoyanes

•

Sep 6, 2024

• 26

upvoted 4 articles almost 2 years ago

Article

Outperforming Claude 3.5 Sonnet with Phi-3-mini-4k for graph entity relationship extraction tasks

rcaulk

•

Aug 19, 2024

• 7

Article

Tensor Parallelism

huseinzol05

•

Aug 20, 2024

• 21

Article

Tokenization Is A Dead Weight (Tokun Part 1)

apehex

•

Jun 27, 2024

• 18

Article

Introduction to State Space Models (SSM)

lbourdois

•

Jul 19, 2024

• 225

upvoted a paper about 2 years ago

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7, 2024 • 65

upvoted 2 papers over 2 years ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 252

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 56

upvoted a paper almost 3 years ago

Stay on topic with Classifier-Free Guidance

Paper • 2306.17806 • Published Jun 30, 2023 • 29

dong xiaolong

AI & ML interests

Organizations

DeepSeek-V3.2

Uncensor any LLM with abliteration

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Zero to Hero with the TRL learning link bomb 💣

LLM Inference at scale with TGI

Outperforming Claude 3.5 Sonnet with Phi-3-mini-4k for graph entity relationship extraction tasks

Tensor Parallelism

Tokenization Is A Dead Weight (Tokun Part 1)

Introduction to State Space Models (SSM)

Yi: Open Foundation Models by 01.AI

Llama 2: Open Foundation and Fine-Tuned Chat Models

Efficient Memory Management for Large Language Model Serving with PagedAttention

Stay on topic with Classifier-Free Guidance

dong xiaolong

AI & ML interests

Organizations

dongXL's activity

Uncensor any LLM with abliteration

Zero to Hero with the TRL learning link bomb 💣

LLM Inference at scale with TGI

Outperforming Claude 3.5 Sonnet with Phi-3-mini-4k for graph entity relationship extraction tasks

Tensor Parallelism

Tokenization Is A Dead Weight (Tokun Part 1)

Introduction to State Space Models (SSM)