OpenThoughts

non-profit

https://openthoughts.ai

AI & ML interests

Open Reasoning Datasets

Recent Activity

penfever authored a paper 12 days ago

OpenThoughts-Agent: Data Recipes for Agentic Models

RZ412 authored a paper 14 days ago

OpenThoughts-Agent: Data Recipes for Agentic Models

penfever updated a dataset 25 days ago

open-thoughts/TaskTrove

View all activity

Organization Card

Community About org cards

https://openthoughts.ai

A community effort to curate the best open post-training datasets.

We are currently working on OpenThoughts-Agent, a collaboration building the best open agent training datasets.

Our first project was curating open reasoning data recipes. OpenThoughts3, our best reasoning dataset recipe, is detailed in our release blog and the full paper.

About us

We are a team of researchers and engineers from Bespoke Labs, Stanford, University of California Berkeley, University of Washington, UT Austin, Juelich Supercomputing Center (JSC), LAION, UCLA, UNC Chapel Hill, and Toyota Research Institute united around building the best datasets (and thus the best models). See our previous works at datacomp.ai and mlfoundations.

Open Thoughts is supported by Bespoke Labs, Lambda Labs, NSF IFML, Juelich Supercomputing Center, UT Austin Machine Learning Lab, Toyota Research Institute.

Collections 6

View 6 collections

models 17

open-thoughts/OpenThinkerAgent-8B-ColdStartSFTForRL

Text Generation • 308k • Updated about 1 month ago • 1.85k • 1

open-thoughts/OpenThinkerAgent-8B-RL

Text Generation • 8B • Updated about 1 month ago • 72 • 2

open-thoughts/OpenThinkerAgent-32B

Text Generation • 677k • Updated Jun 8 • 6.4k • • 8

open-thoughts/OpenThinkerAgent-32B-SFT-100K

Text Generation • 677k • Updated Jun 8 • 94

open-thoughts/OpenThinkerAgent-32B-SFT-31.6K

Text Generation • 677k • Updated Jun 8 • 7

open-thoughts/OpenThinkerAgent-32B-SFT-10K

Text Generation • 677k • Updated Jun 8 • 5

open-thoughts/OpenThinkerAgent-32B-SFT-3.16K

Text Generation • 677k • Updated Jun 8 • 8

open-thoughts/OpenThinkerAgent-32B-SFT-1K

Text Generation • 677k • Updated Jun 8 • 7

open-thoughts/OpenThinkerAgent-32B-SFT-316

Text Generation • 677k • Updated Jun 8 • 12

open-thoughts/OpenThinker-Agent-v1-SFT

Text Generation • 308k • Updated Jan 27 • 650 • • 9

datasets 19

open-thoughts/TaskTrove

Viewer • Updated 25 days ago • 17.2k • 1.67k • 22

open-thoughts/OpenThoughts-Agent-SFT-ColdStartForRL-10K

Viewer • Updated about 1 month ago • 9.44k • 158 • 1

open-thoughts/OpenThoughts-Agent-RL-5K

Viewer • Updated about 1 month ago • 5k • 163 • 1

open-thoughts/OpenThoughts-Agent-SFT-100K

Viewer • Updated Jun 8 • 94.3k • 1.1k • 16

open-thoughts/OpenThoughts-Agent-SFT-31.6K

Viewer • Updated Jun 8 • 31.6k • 47

open-thoughts/OpenThoughts-Agent-SFT-10K

Viewer • Updated Jun 8 • 10k • 230

open-thoughts/OpenThoughts-Agent-SFT-3.16K

Viewer • Updated Jun 8 • 3.16k • 53

open-thoughts/OpenThoughts-Agent-SFT-1K

Viewer • Updated Jun 8 • 1k • 331 • 1

open-thoughts/OpenThoughts-Agent-SFT-316

Viewer • Updated Jun 8 • 316 • 43

open-thoughts/AgentTrove

Viewer • Updated May 7 • 1.7M • 2.5k • 188

View 19 datasets