metadata
license: mit
library_name: transformers
pipeline_tag: text-generation
This is the model used in paper, M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models.
Code: https://github.com/jxiw/M1
@article{wang2025m1scalabletesttimecompute,
title={M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models},
author={Junxiong Wang and Wen-Ding Li and Daniele Paliotta and Daniel Ritter and Alexander M. Rush and Tri Dao},
journal={arXiv preprint arXiv:2504.10449},
year={2025},
url={https://arxiv.org/abs/2504.10449},
}