8 2 2

hongbin

L-Hongbin

L-Hongbin

AI & ML interests

None yet

Recent Activity

commented on a paper 4 days ago

Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques

commented on a paper 4 days ago

DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models

updated a collection 2 months ago

LLM

View all activity

Organizations

None yet

commented 2 papers 4 days ago

Optimizing Large Language Models through Quantization: A Comparative Analysis of PTQ and QAT Techniques

Paper • 2411.06084 • Published Nov 9, 2024 •

DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models

Paper • 2504.09223 • Published Apr 12 •

updated a collection 2 months ago

LLM

Collection

47 items • Updated May 29 • 1

updated 3 collections 3 months ago

commented a paper 3 months ago

70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float

Paper • 2504.11651 • Published Apr 15 • 28 •

updated a collection 3 months ago

LLM

Collection

47 items • Updated May 29 • 1

updated 3 collections 4 months ago

MutiModal_Dataset

Collection

23 items • Updated Apr 23

MutiModal_Paper

Collection

44 items • Updated Apr 23 • 1

LLM

Collection

47 items • Updated May 29 • 1

commented a paper 4 months ago

Agent models: Internalizing Chain-of-Action Generation into Reasoning models

Paper • 2503.06580 • Published Mar 9 • 19 •

hongbin

AI & ML interests

Recent Activity

Organizations

L-Hongbin's activity