Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
118.0
TFLOPS
4
1
3
Lucas P.
CosmossG
Follow
Mi6paulino's profile picture
1 follower
·
30 following
AI & ML interests
None yet
Recent Activity
reacted
to
fdaudens
's
post
with 🔥
4 days ago
You might not have heard of Moonshot AI — but within 24 hours, their new model Kimi K2 shot to the top of Hugging Face’s trending leaderboard. So… who are they, and why does it matter? Had a lot of fun co-writing this blog post with @xianbao, with key insights translated from Chinese, to unpack how this startup built a model that outperforms GPT-4.1, Claude Opus, and DeepSeek V3 on several major benchmarks. 🧵 A few standout facts: 1. From zero to $3.3B in 18 months: Founded in March 2023, Moonshot is now backed by Alibaba, Tencent, Meituan, and HongShan. 2. A CEO who thinks from the end: Yang Zhilin (31) previously worked at Meta AI, Google Brain, and Carnegie Mellon. His vision? Nothing less than AGI — still a rare ambition among Chinese AI labs. 3. A trillion-parameter model that’s surprisingly efficient: Kimi K2 uses a mixture-of-experts architecture (32B active params per inference) and dominates on coding/math benchmarks. 4. The secret weapon: Muon optimizer: A new training method that doubles efficiency, cuts memory in half, and ran 15.5T tokens with zero failures. Big implications. Most importantly, their move from closed to open source signals a broader shift in China’s AI scene — following Baidu’s pivot. But as Yang puts it: “Users are the only real leaderboard.” 👇 Check out the full post to explore what Kimi K2 can do, how to try it, and why it matters for the future of open-source LLMs: https://huggingface.co/blog/fdaudens/moonshot-ai-kimi-k2-explained
new
activity
4 days ago
Fentible/Cthulhu-24B-v1:
Cydonia v4
new
activity
9 days ago
Fentible/Cthulhu-24B-v1:
Great merge!
View all activity
Organizations
None yet
CosmossG
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
3 Spaces
about 1 year ago
Runtime error
1.22k
1.22k
ChatGPT Prompt Generator
👨
Running
on
T4
MCP
1.3k
1.3k
CLIP Interrogator 2
🕵
Generate image prompts using different modes
Running
on
A10G
5.03k
5.03k
MusicGen
🎵
Generate music from text descriptions