85 9 185

t.d.a.g. PRO

sequelbox

sequelbox.bsky.social

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

updated a dataset about 2 hours ago

sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

new activity about 2 hours ago

sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW:[bot] Conversion to Parquet

posted an update 1 day ago

EARLY SNEAK PREVIEW: get a first look at the Celestia 3 science-reasoning dataset, built with DeepSeek's newest R1-0528 reasoning model! Subjects include physics, chemistry, biology, computer science, Earth science, astronomy, and information theory. This early look contains the first 14k rows, all synthetic responses using https://huggingface.co/deepseek-ai/DeepSeek-R1-0528 SEE IT HERE: https://huggingface.co/datasets/sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW Support our releases: https://huggingface.co/spaces/sequelbox/SupportOpenSource Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune. more to come soon! allegra

View all activity

Organizations

sequelbox's activity

updated a dataset about 2 hours ago

sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Viewer • Updated about 2 hours ago • 14k • 13 • 2

New activity in sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW about 2 hours ago

[bot] Conversion to Parquet

#1 opened about 4 hours ago by

parquet-converter

posted an update 1 day ago

Post

325

EARLY SNEAK PREVIEW: get a first look at the Celestia 3 science-reasoning dataset, built with DeepSeek's newest R1-0528 reasoning model! Subjects include physics, chemistry, biology, computer science, Earth science, astronomy, and information theory.

This early look contains the first 14k rows, all synthetic responses using deepseek-ai/DeepSeek-R1-0528

SEE IT HERE: sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Support our releases: sequelbox/SupportOpenSource

Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune.

more to come soon!
allegra

liked a dataset 1 day ago

sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Viewer • Updated about 2 hours ago • 14k • 13 • 2

published a dataset 1 day ago

sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Viewer • Updated about 2 hours ago • 14k • 13 • 2

updated a collection 6 days ago

Esper 3

Collection

Esper 3 is a DevOps, architecture, code, and general reasoning finetune for Qwen 3! • 4 items • Updated 6 days ago • 1

updated a Space 6 days ago

README

👀

Valiant Labs: Organization Card

posted an update 6 days ago

Post

283

NEW RELEASE: we've brought Esper 3 to the new deepseek-ai/DeepSeek-R1-0528-Qwen3-8B model!

- A full-stack software assistant: a reasoning finetune focused on coding, architecture, and DevOps using the Titanium and Tachibana datasets!
- Improved general and creative reasoning skills, powered by the Raiden dataset.

Get the newest Esper 3: ValiantLabs/DeepSeek-R1-0528-Qwen3-8B-Esper3
Support our releases: sequelbox/SupportOpenSource

more on the way next week!

celestially yours ;)
allegra

published a model 6 days ago

ValiantLabs/DeepSeek-R1-0528-Qwen3-8B-Esper3

Text Generation • Updated 6 days ago • 40 • 1

updated a model 6 days ago

ValiantLabs/DeepSeek-R1-0528-Qwen3-8B-Esper3

Text Generation • Updated 6 days ago • 40 • 1

liked a model 7 days ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • Updated 8 days ago • 118k • • 669

liked a dataset 7 days ago

open-r1/Mixture-of-Thoughts

Viewer • Updated 11 days ago • 699k • 24.7k • 194

replied to their post 7 days ago

we'll be expanding Qwen sizes in both directions :) thanks for your review!

posted an update 8 days ago

Post

321

Updates for the week:
- released some new merge models using ValiantLabs/Qwen3-14B-Esper3 and other Qwen 3 14b finetunes - these merges include math, Web3, uncensored, and general mix. depending on your use case for Esper 3 these may be helpful to you! find them at @sequelbox
- coming up we'll have more model sizes for Esper 3 and Cobalt 2, releasing soon!
- also super excited for more dataset releases with the newly released deepseek-ai/DeepSeek-R1-0528

Support the above efforts and others: sequelbox/SupportOpenSource

back to building :)

2 replies

reacted to lukmanaj's post with 👍 8 days ago

Post

2346

I am so happy to share to all that I’ve just completed the first unit of the new MCP course on Hugging Face and earned my certificate! The AI acceleration track is intense and fast-paced, but I’m doing my best to keep up. Excited for what’s ahead!

1 reply

published 4 models 8 days ago

updated a model 8 days ago

sequelbox/Qwen3-14B-Esper3Mix

Text Generation • Updated 8 days ago • 2