45 5 177

sometimesanotion

https://ko-fi.com/sometimesanotion

AI & ML interests

Agentic LLM services, model merging, finetunes, distillation

Recent Activity

reacted to sequelbox's post with 🔥 about 5 hours ago

EARLY SNEAK PREVIEW: get a first look at the Celestia 3 science-reasoning dataset, built with DeepSeek's newest R1-0528 reasoning model! Subjects include physics, chemistry, biology, computer science, Earth science, astronomy, and information theory. This early look contains the first 14k rows, all synthetic responses using https://huggingface.co/deepseek-ai/DeepSeek-R1-0528 SEE IT HERE: https://huggingface.co/datasets/sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW Support our releases: https://huggingface.co/spaces/sequelbox/SupportOpenSource Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune. more to come soon! allegra

liked a dataset about 5 hours ago

sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

liked a model about 23 hours ago

Blancy/Qwen3-1.7B-Open-R1-GRPO

View all activity

Organizations

sometimesanotion's activity

reacted to sequelbox's post with 🔥 about 5 hours ago

Post

170

EARLY SNEAK PREVIEW: get a first look at the Celestia 3 science-reasoning dataset, built with DeepSeek's newest R1-0528 reasoning model! Subjects include physics, chemistry, biology, computer science, Earth science, astronomy, and information theory.

This early look contains the first 14k rows, all synthetic responses using deepseek-ai/DeepSeek-R1-0528

SEE IT HERE: sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Support our releases: sequelbox/SupportOpenSource

Coming up we'll have more dataset releases, including some novel reasoning and analysis methods - we think an important role for open source researchers is experimenting with new response styles on top of the increasingly excellent base models available to finetune.

more to come soon!
allegra

liked a dataset about 5 hours ago

sequelbox/Celestia3-DeepSeek-R1-0528-PREVIEW

Viewer • Updated about 14 hours ago • 14k • 13 • 2

liked a model about 23 hours ago

Blancy/Qwen3-1.7B-Open-R1-GRPO

Text Generation • Updated 6 days ago • 23 • 2

replied to CultriX's post 2 days ago

Now imagine this as a hashtag generator and so a RAG search can find great context. :)

replied to CultriX's post 2 days ago

Neat! I've transitioned from wanting more from a model's one-shot answers to breaking things down and walking through the problem with cached context. This effectively means simulating most of the thinking block, but by tool usage and RAG.

I'm happily using our models from months ago to do it. If anything - even Lamarck 0.7's use of thinking blocks are a bit much. I'm using Lamarck 0.7 Fusion (my best GPQA model, though it didn't break your record and is best used where modest IFEVAL isn't a blocker) and /nothink with ValiantLab's Qwen3 models in concert.

I suspect I'll try some merges soon to give this toolchain better models, leaderboard or no leaderboard!

liked a model 7 days ago

MaatAI/Seshat-Qwen3-8B

Text Generation • Updated 7 days ago • 68 • 3

replied to sequelbox's post 7 days ago

I've been using Esper3 8B and 14B for first-pass code review. I am quite pleased.

Have you considered fine-tuning a 1.7B or smaller model for autocomplete?

replied to CultriX's post 7 days ago

I've been thinking a lot about using small caches of embeddings for local RAG lately. Have you considered an HTTP caching proxy like Squid as a low-impact source? It would retrieve what a user is reading anyway, and what's in their field of interest. A browser extension to signal some limited ingestion when a page is bookmarked might fit a lot of use cases.

For many reasons, smart management of context windows is my top priority with AI now!

liked a model 8 days ago