HelpingAI

company

Verified

https://helpingai.co/

helping_ai

HelpingAI

helping-ai

Activity Feed

AI & ML interests

Helping AI to become AGI

Recent Activity

Abhaykoul updated a Space about 2 hours ago

HelpingAI/Dhanishtha-2.0-preview

KingNish updated a model about 1 month ago

HelpingAI/Dhanishtha-2.0-0126

KingNish published a model about 1 month ago

HelpingAI/Dhanishtha-2.0-0126

View all activity

Abhaykoul

updated a Space about 2 hours ago

Dhanishtha 2.0 Preview

🏆

Chat with an AI that shows its reasoning steps

KingNish

updated a model about 1 month ago

HelpingAI/Dhanishtha-2.0-0126

Text Generation • 15B • Updated Feb 7 • 9 • 2

KingNish

published a model about 1 month ago

HelpingAI/Dhanishtha-2.0-0126

Text Generation • 15B • Updated Feb 7 • 9 • 2

Abhaykoul

updated a collection about 2 months ago

Dhanishtha model Checkpoints

Collection

Our Reasoning models • 8 items • Updated 17 days ago • 4

KingNish

posted an update 3 months ago

Post

3240

Muon vs MuonClip vs Muon+Adamw

Muon has gone from an experiment to a mainstream optimizer, but does it hold up for fine‑tuning? We ran head‑to‑head tests on Qwen3‑4B (10k+ high‑quality instruction rows) to find out.

Short story: Pure Muon converged fastest at the start, but its gradient‑norm spikes made training unstable. MuonClip (Kimi K2’s clipping) stabilizes long pretraining runs, yet in our small‑scale fine‑tune it underperformed, lower token accuracy and slower convergence. The winner was the hybrid: Muon for 2D layers + AdamW for 1D layers. It delivered the best balance of stability and final performance and even beat vanilla AdamW.

Takeaway: for small-scale fine-tuning, hybrid = practical and reliable.

Next Step: scale to larger models/datasets to see if Muon’s spikes become catastrophic or if clipping wins out.

Full Blog Link: https://huggingface.co/blog/KingNish/optimizer-part1

KingNish

posted an update 3 months ago

Post

2713

I tested Muon vs MuonClip vs Muon+AdamW for fine-tuning LLMs
Just published a blog on that, Read here 👉 https://huggingface.co/blog/KingNish/optimizer-part1