Le Huy Hoang's picture

Le Huy Hoang

splendor1811

AI & ML interests

Computer Vision

Recent Activity

updated a Space 2 months ago
splendor1811/AlfredAgent
published a Space 2 months ago
splendor1811/AlfredAgent
View all activity

Organizations

None yet

splendor1811's activity

upvoted an article 17 days ago
view article
Article

Vision Language Models (Better, Faster, Stronger)

By merve and 4 others โ€ข
โ€ข 388
updated a Space 2 months ago
published a Space 2 months ago
updated a Space 4 months ago
upvoted 2 articles 4 months ago
view article
Article

Open-source DeepResearch โ€“ Freeing our search agents

By m-ric and 4 others โ€ข
โ€ข 1.25k
view article
Article

SmolVLM - small yet mighty Vision Language Model

By andito and 4 others โ€ข
โ€ข 296
reacted to sometimesanotion's post with ๐Ÿš€ 4 months ago
view post
Post
3339
**Update** Either I had some wrong numbers plugged in to estimate benchmark numbers from comparator, or the benchmark changed. Virtuoso Small v2 at 41.07 average is still very impressive, especially for writing draft copy for business purposes, while Lamarck remains a chatty generalist-reasoning model.

I've felt confident that 14B Qwen finetunes and merges could break the 42.0 average, and Arcee **came close** with https://huggingface.co/arcee-ai/Virtuoso-Small-2. Congratulations to @arcee-ai !

Just two months ago, it was easy to think that 14B had plateaued, that you could have high IFEVAL or high MUSR/MATH/GPQA at 14B, but not both. That barrier is completely shattered. I see a pathway to even better, and Virtuoso Small 2 is a big part of why. Very impressive work. This community would expect no less from Arcee.

Just look at this graph! Keep in mind, my merges here build on the first Virtuoso Small, and *-DS merges build on DeepSeek R1. There are some impressive merges in the pipe!
ยท
upvoted an article about 1 year ago