SAMBIT CHAKRABORTY
sambitchakhf03
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 8 hours ago
SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference
upvoted
a
paper
about 9 hours ago
Multilingual Machine Translation with Open Large Language Models at
Practical Scale: An Empirical Study
upvoted
a
paper
about 14 hours ago
Slamming: Training a Speech Language Model on One GPU in a Day
Organizations
Collections
5
-
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU
Paper • 2403.06504 • Published • 53 -
Stealing Part of a Production Language Model
Paper • 2403.06634 • Published • 91 -
MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression
Paper • 2406.14909 • Published • 15
models
2
datasets
None public yet