sparse_decode

university

AI & ML interests

None defined yet.

laxury

authored a paper 3 months ago

FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Paper • 2502.20766 • Published Feb 28 • 1

rookiemango

authored a paper 4 months ago

Efficient Pretraining Length Scaling

Paper • 2504.14992 • Published Apr 21 • 20