skysight-inc (Skysight)

sethkimmel3

updated a dataset 3 months ago

skysight-inc/synthetic-humans-1m

Viewer • Updated Apr 26 • 901k • 73 • 1

sea-snell

authored a paper 3 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21 • 43

sethkimmel3

posted an update 3 months ago

Post

403

We create a dataset of 1 million, MIT-licensed synthetic humans, sampled from actual US demographics. You can use it to seed LLM synthetic data generation and create extremely diverse, statistically realistic outputs.

Dataset: skysight-inc/synthetic-humans-1m

Accompanying blog post with methodology: https://www.skysight.inc/blog/synthetic-humans