Open Thoughts

non-profit
Activity Feed

AI & ML interests

Open Reasoning Datasets

Recent Activity

ryanmarten  updated a dataset about 7 hours ago
open-thoughts/OpenThoughts2-1M
sedrickkeh  updated a model about 11 hours ago
open-thoughts/OpenThinker2-32B
sedrickkeh  updated a model about 12 hours ago
open-thoughts/OpenThinker2-7B
View all activity

https://open-thoughts.ai

Curating the best open reasoning datasets. A Bespoke Labs and DataComp community effort.

Our first goal is to curate a reasoning dataset to train state of the art small reasoning models that surpass DeepSeek-R1-Distill-32B and DeepSeek-R1-Distill-7B on math and code reasoning benchmarks.

About us

We are a team of researchers and engineers from Bespoke Labs, Stanford, University of California Berkeley, University of Washington, UT Austin, Juelich Supercomputing Center (JSC), LAION, UCLA, UNC Chapel Hill, and Toyota Research Institute united around building the best datasets (and thus the best models). See our previous works at datacomp.ai and mlfoundations.

Open Thoughts is supported by Bespoke Labs, Lambda Labs, NSF IFML, Juelich Supercomputing Center, UT Austin Machine Learning Lab, Toyota Research Institute.