Activity Feed

AI & ML interests

Evaluating open LLMs

Recent Activity

open-llm-leaderboard's activity

AdinaYΒ 
posted an update 4 days ago
AdinaYΒ 
posted an update 5 days ago
view post
Post
3953
Kimi-Dev πŸ’» New coding model by Moonshot AI

moonshotai/Kimi-Dev-72B

✨ 72B - MIT license
✨ 60.4% on SWE-bench Verified
✨ RL-trained to patch real repos in Docker
✨ Only rewarded if full test suite passes
AdinaYΒ 
posted an update 5 days ago
view post
Post
604
MiniMax-M1 πŸ”₯ The First reasoning model by MiniMax.

MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094

✨ 40k/80k thinking budget
✨ Powered by Hybrid MoE + Lightning Attention πŸ‘€
✨ 1M context length 🀯
✨ Apache 2.0
✨ RL-trained for math, coding & real-world software
AdinaYΒ 
posted an update 5 days ago
view post
Post
396
Hunyuan 3D 2.1 πŸ”₯ Industrial-grade 3D model just released by Tencent Hunyuan

tencent/Hunyuan3D-2.1
tencent/Hunyuan3D-2.1

✨ PBR materials: leather, bronze & more, breathtaking realism under any light
✨ Consumer GPU-ready: good for developers and small teams

AdinaYΒ 
posted an update 11 days ago
victorΒ 
posted an update 11 days ago
view post
Post
1977
Open Source Avengers, Assemble! Ask an expert AI agent team to solve complex problems together πŸ”₯

Consilium brings together multiple agents that debate and use live research (web, arXiv, SEC) to reach a consensus. You set the strategy, they find the answer.

Credit to @azettl for this awesome demo: Agents-MCP-Hackathon/consilium_mcp
  • 2 replies
Β·
freddyaboultonΒ 
posted an update 12 days ago
view post
Post
434
Time is running out! ⏰

Less than 24 hours to participate in the MCP Hackathon and win thousands of dollars in prizes! Don't miss this opportunity to showcase your skills.

Visit Agents-MCP-Hackathon/AI-Marketing-Content-Creator to register!

freddyaboultonΒ 
posted an update 12 days ago
view post
Post
335
🚨 NotebookLM Dethroned?! 🚨

Meet Fluxions vui: The new open-source dialogue generation model.
🀯 100M Params, 40k hours audio!
πŸŽ™οΈ Multi-speaker audio
πŸ˜‚ Non-speech sounds (like [laughs]!)
πŸ“œ MIT License

Is this the future of content creation? Watch the video and decide for yourself!

https://huggingface.co/spaces/fluxions/vui-spacehttps://huggingface.co/fluxions/vui
  • 1 reply
Β·
AdinaYΒ 
posted an update 12 days ago
view post
Post
3166
RoboBrain 2.0πŸ”₯ OPEN embedded brain model by BAAIBeijing

BAAI/RoboBrain2.0-7B

✨ 7B - Apache 2.0 / 32B coming soon
✨ Supports multiple images, long videos, and high-resolution visuals
✨ Spatial + temporal reasoning
✨ Real-time memory & scene graphs
AdinaYΒ 
posted an update 14 days ago
view post
Post
2675
RedNote 小纒书 just released their first LLM πŸ”₯

dots.llm1.base πŸͺ a 142B MoE model with only 14B active params.

rednote-hilab/dotsllm1-68246aaaaba3363374a8aa7c
✨ Base & Instruct - MIT license
✨ Trained on 11.2T non-synthetic high-quality data
✨ Competitive with Qwen2.5/3 on reasoning, code, alignment
AdinaYΒ 
posted an update 14 days ago
view post
Post
445
MiniCPM4πŸ”₯ efficient LLMs built for end-side devices, by OpenBMB

openbmb/minicpm4-6841ab29d180257e940baa9b

✨ Apache 2.0
✨ 5–7Γ— Faster Inference (Jetson Orin & RTX 4090)
✨ 8B trained on 8T clean, non-synthetic tokens
✨ 32K Native Context -> 128K+ with InfLLM v2 + LongRoPE
✨ Runs on πŸ€—Transformers , http://CPM.cu, vLLM, and SGLang
AdinaYΒ 
posted an update 16 days ago
AdinaYΒ 
posted an update 16 days ago
view post
Post
1608
OpenAudio S1-mini πŸ”Š a new OPEN multilingual TTS model trained on 2M+ hours of data, by FishAudio

fishaudio/openaudio-s1-mini

✨ Supports 14 languages
✨ 50+ emotions & tones
✨ RLHF-optimized
✨ Special effects: laughing, crying, shouting, etc.
  • 1 reply
Β·
AdinaYΒ 
posted an update 17 days ago
AdinaYΒ 
posted an update 18 days ago
view post
Post
880
SynLogic 🧠 logical reasoning model & dataset by MiniMax.

MiniMaxAI/synlogic-6836c3246fca0277657ff032

✨ 3 models: 7B/32B/ Mix-3-32B (MIT license)
✨ Dataset: 35 verifiable logic tasks (Sudoku, Cipher, Arrow Maze etc.)
✨ RL training with auto-verifiable rewards
✨ Generalizes to math without explicit math training
✨ +6 pts on BBEH, +9.5 on KOR-Bench vs baselines
AdinaYΒ 
posted an update 18 days ago
view post
Post
1661
Video-XL-2 πŸ”₯ long video understanding model by BAAI & Shanghai Jiaotong University

BAAI/Video-XL-2

✨ Apache 2.0
✨ Handles up to 10,000+ frames on a single GPU
✨ 2048-frame encoding in just 12s
✨ Efficient Chunk-based Prefilling & Bi-granularity KV decoding
AdinaYΒ 
posted an update 19 days ago
view post
Post
2154
May highlights from China’s open source ecosystem πŸ”₯

zh-ai-community/may-2025-open-works-from-the-chinese-community-681a3494145f2914dc679b7c

✨ DeepSeek dropped R1 updates
- Both R1 & 8B distralled smol model

✨ Bytedance goes big on open source:
- BAGEL, Dolphin, Seedcoder, Dream0...

✨ Multimodal is on fire!
- HuyuanCustom / HunyuanVideo-Avatar / HunyuanPortrait
- MiniMax: SynLogic / Orsta-7B
- Xiaomi: MiMo VL
- Alibaba Wan: Wan2.1-VACE
- OpenGVlab: ZeroGUI
- StepFun: ACE-Step-v1/Step1X-3D

✨ Specialized models/datasets excels
- Alibaba Qwen: World PM 72B
- BAAI:RobotBrain (MLLM for robotic)
- HiThink Research: BizFinBench (dataset)
- OpenBMB: Ultra FineWeb (dataset)
- Bilibili: Index-anisora (Anime/ACG)
- Skywork:Matrix-Game (game)

More awesome releases: Alibaba QwenLong-L1-32B, SkyWork OR1, OpenS2V-5M etc...
AdinaYΒ 
posted an update 22 days ago
view post
Post
537
MiMo-VL πŸ”₯ smol & mighty vision language model by Xiaomi

XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212

✨ 7B with RL & SFT
✨ Native resolution ViT for fine grained perception
✨ MORL = smarter alignment across perception, grounding & reasoning