Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JunxiongWang 's Collections
M1
MambaInLlama_MATH_Reasoning
MambaInLlama-dpo
MambaInLlama-distill
Mamba2InLlama3.2-3B
Mamba-In-Zephyr
Mamba-In-Llama3
Mamba2-In-Llama3
MambaByte

MambaInLlama_MATH_Reasoning

updated about 9 hours ago

Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners. https://arxiv.org/pdf/2502.20339

Upvote
-

  • JunxiongWang/MambaInLlama1B_Distill_MATH

    1B • Updated Jan 23 • 80

  • JunxiongWang/MambaInLlama1B_SFT_MATH

    1B • Updated Feb 11 • 12

  • JunxiongWang/MambaInLlama3B_SFT_MATH

    3B • Updated Feb 7 • 41

  • JunxiongWang/MambaInLlama3B_Distill_MATH

    3B • Updated Jan 27 • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs