Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
l3lab 's Collections
L1
miniCTX

L1

updated Jul 13

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Upvote
7

  • l3lab/L1-Qwen-7B-Max

    8B • Updated Jul 13 • 635

  • l3lab/L1-Qwen3-8B-Max

    8B • Updated Jul 13 • 55

  • l3lab/L1-Qwen-7B-Exact

    8B • Updated Jul 13 • 54

  • l3lab/L1-Qwen3-8B-Exact

    8B • Updated Jul 13 • 8

  • l3lab/L1-Qwen-1.5B-Exact

    2B • Updated Apr 7 • 3.33k • 6

  • l3lab/L1-1.5B-Short

    2B • Updated Jul 12 • 5

  • l3lab/L1-Qwen-1.5B-Max

    2B • Updated Mar 7 • 1.86k • 16
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs