Replaced get_max_length with get_max_cache_shape
#12 opened 2 months ago
by
MHGN
The model only contains mamba and no attention
#11 opened 6 months ago
by
AscendingGrass
Nvidia ChatRTX
#8 opened 8 months ago
by
monostronomy