
OctoThinker/OctoThinker-8B-Hybrid-Base
Updated
ā¢
12
ā¢
2
None defined yet.
š OctoThinker is led by GAIR
šÆ Our Goal: To reshape the pre-training trajectory so models scale better under RL.