EmbRACE-3K: Embodied Reasoning and Action in Complex Environments Paper • 2507.10548 • Published Jul 14 • 36
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning Paper • 2506.08889 • Published Jun 10 • 24
SeerAttention/SeerAttention-Decode-R1-Distill-Qwen-14B-AttnGates Text Generation • Updated Jun 9 • 165
SeerAttention/SeerAttention-Decode-R1-Distill-Qwen-14B-AttnGates Text Generation • Updated Jun 9 • 165
SeerAttention/SeerAttention-DeepSeek-R1-Distill-Qwen-32B-AttnGates Text Generation • Updated Mar 3 • 9 • 1
SeerAttention/SeerAttention-DeepSeek-R1-Distill-Qwen-32B-AttnGates Text Generation • Updated Mar 3 • 9 • 1