2 15 1

Alistarh

d-alistarh

AI & ML interests

NLP

Recent Activity

upvoted a paper about 1 month ago

Unified Scaling Laws for Compressed Representations

upvoted a paper about 1 month ago

SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models

upvoted a paper about 1 month ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

View all activity

Organizations

upvoted 3 papers about 1 month ago

upvoted a paper about 2 months ago

Scaling Law for Quantization-Aware Training

Paper • 2505.14302 • Published May 20 • 74

upvoted a paper 3 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

commented a paper 3 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110 •

upvoted a paper 5 months ago

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Paper • 2502.07780 • Published Feb 11 • 18

authored 2 papers 5 months ago

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Paper • 2501.02625 • Published Jan 5 • 16

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published Feb 7 • 44

upvoted 2 papers 5 months ago

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Paper • 2501.02625 • Published Jan 5 • 16

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published Feb 7 • 44

commented a paper 5 months ago

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published Feb 7 • 44 •

authored 8 papers 7 months ago

Model compression via distillation and quantization

Paper • 1802.05668 • Published Feb 15, 2018 • 1

Sparse Finetuning for Inference Acceleration of Large Language Models

Paper • 2310.06927 • Published Oct 10, 2023 • 14

Towards End-to-end 4-Bit Inference on Generative Large Language Models

Paper • 2310.09259 • Published Oct 13, 2023 • 1

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

Paper • 2306.03078 • Published Jun 5, 2023 • 3

Error Feedback Can Accurately Compress Preconditioners

Paper • 2306.06098 • Published Jun 9, 2023

RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation

Paper • 2401.04679 • Published Jan 9, 2024 • 2

Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11, 2024 • 13

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

Paper • 2308.02060 • Published Aug 3, 2023 • 1

Alistarh

AI & ML interests

Recent Activity

Organizations

d-alistarh's activity