t.d.a.g. PRO

sequelbox

AI & ML interests

open source, infinite games. (they/them)

Recent Activity

Organizations

Valiant Labs's profile picture

sequelbox's activity

New activity in sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW about 2 hours ago
posted an update 1 day ago
view post
Post
325
EARLY SNEAK PREVIEW: get a first look at the Celestia 3 science-reasoning dataset, built with DeepSeek's newest R1-0528 reasoning model! Subjects include physics, chemistry, biology, computer science, Earth science, astronomy, and information theory.

This early look contains the first 14k rows, all synthetic responses using deepseek-ai/DeepSeek-R1-0528

SEE IT HERE: sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Support our releases: sequelbox/SupportOpenSource

Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune.

more to come soon!
allegra
posted an update 6 days ago
view post
Post
283
NEW RELEASE: we've brought Esper 3 to the new deepseek-ai/DeepSeek-R1-0528-Qwen3-8B model!

- A full-stack software assistant: a reasoning finetune focused on coding, architecture, and DevOps using the Titanium and Tachibana datasets!
- Improved general and creative reasoning skills, powered by the Raiden dataset.

Get the newest Esper 3: ValiantLabs/DeepSeek-R1-0528-Qwen3-8B-Esper3
Support our releases: sequelbox/SupportOpenSource

more on the way next week!

celestially yours ;)
allegra
replied to their post 7 days ago
view reply

we'll be expanding Qwen sizes in both directions :) thanks for your review!

posted an update 8 days ago
view post
Post
321
Updates for the week:
- released some new merge models using ValiantLabs/Qwen3-14B-Esper3 and other Qwen 3 14b finetunes - these merges include math, Web3, uncensored, and general mix. depending on your use case for Esper 3 these may be helpful to you! find them at @sequelbox
- coming up we'll have more model sizes for Esper 3 and Cobalt 2, releasing soon!
- also super excited for more dataset releases with the newly released deepseek-ai/DeepSeek-R1-0528

Support the above efforts and others: sequelbox/SupportOpenSource

back to building :)
  • 2 replies
·
reacted to lukmanaj's post with 👍 8 days ago
view post
Post
2346
I am so happy to share to all that I’ve just completed the first unit of the new MCP course on Hugging Face and earned my certificate! The AI acceleration track is intense and fast-paced, but I’m doing my best to keep up. Excited for what’s ahead!
  • 1 reply
·