-
SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement
Paper • 2504.03561 • Published • 18 -
Agent models: Internalizing Chain-of-Action Generation into Reasoning models
Paper • 2503.06580 • Published • 18 -
LLMs achieve adult human performance on higher-order theory of mind tasks
Paper • 2405.18870 • Published • 18
koskokos
koskokos
·
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 month ago
nari-labs/Dia-1.6B
reacted
to
sometimesanotion's
post
with 👍
about 1 month ago
The capabilities of the new Qwen 3 models are fascinating, and I am watching that space!
My experience, however, is that context management is vastly more important with them. If you use a client with a typical session log with rolling compression, a Qwen 3 model will start to generate the same messages over and over. I don't think that detracts from them. They're optimized for a more advanced MCP environment. I honestly think the 8B is optimal for home use, given proper RAG/CAG.
In typical session chats, Lamarck and Chocolatine are still my daily drives. I worked hard to give Lamarck v0.7 a sprinkling of CoT from both DRT and Deepseek R1. While those models got surpassed on the leaderboards, in practice, I still really enjoy their output.
My projects are focusing on application and context management, because that's where the payoff in improved quality is right now. But should there be a mix of finetunes to make just the right mix of - my recipes are standing by.
liked
a model
about 1 month ago
microsoft/Phi-4-reasoning-plus
Organizations
None yet
Collections
1
models
0
None public yet
datasets
0
None public yet