MIT HAN Lab

university

https://hanlab.mit.edu/

SongHan_MIT

mit-han-lab

AI & ML interests

Efficient algorithm, system, and hardware for machine learning.

Recent Activity

Lmxyy new activity about 20 hours ago

mit-han-lab/svdq-flux.1-t5:Update quantized t5 model

Lmxyy updated a model 3 days ago

mit-han-lab/svdquant-artifacts

Lmxyy published a model 3 days ago

mit-han-lab/svdquant-artifacts

View all activity

mit-han-lab's activity

Lmxyy

in mit-han-lab/svdq-flux.1-t5 about 20 hours ago

Update quantized t5 model

#2 opened about 21 hours ago by

Lmxyy

updated a model 3 days ago

mit-han-lab/svdquant-artifacts

Updated 3 days ago

Lmxyy

published a model 3 days ago

mit-han-lab/svdquant-artifacts

Updated 3 days ago

Lmxyy

in mit-han-lab/svdq-int4-flux.1-dev 13 days ago

🚩 Report: Copyright infringement

#3 opened 13 days ago by

Shangy

updated a dataset 13 days ago

mit-han-lab/awq-model-zoo

Updated 13 days ago • 1.79k • 19

Ligeng-Zhu

updated a collection 18 days ago

llama4-family

5 items • Updated 18 days ago

Lmxyy

in mit-han-lab/svdq-int4-flux.1-dev 21 days ago

possible ipadapter support?

#2 opened 21 days ago by

reverentelusarca

Lmxyy

updated a model 22 days ago

mit-han-lab/nunchaku

Updated 22 days ago • 38

Lmxyy

updated a dataset 23 days ago

mit-han-lab/nunchaku-test

Viewer • Updated 23 days ago • 440 • 150

Lmxyy

updated a dataset 25 days ago

mit-han-lab/svdquant-datasets

Preview • Updated 25 days ago • 503 • 2

songhan

authored a paper 2 months ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20 • 13

synxlin

authored a paper 2 months ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20 • 13

Guangxuan-Xiao

authored a paper 2 months ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20 • 13

kentang1998

authored a paper 2 months ago

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20 • 13

Shangy

authored a paper 2 months ago

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Paper • 2306.00978 • Published Jun 1, 2023 • 9