Amir Mahla's picture
3 1

Amir Mahla

A-Mahla

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture H company's profile picture smolagents's profile picture GeekAgents's profile picture

A-Mahla's activity

upvoted an article 9 days ago
view article
Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

โ€ข 38
published an article 9 days ago
view article
Article

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

โ€ข 38
upvoted an article 12 days ago
view article
Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

By Hcompany and 1 other โ€ข
โ€ข 65
upvoted an article 30 days ago
reacted to merve's post with ๐Ÿ”ฅ 2 months ago
view post
Post
4490
sooo many open AI releases past week, let's summarize! ๐Ÿค—
merve/april-11-releases-67fcd78be33d241c0977b9d2

multimodal
> Moonshot AI released Kimi VL Thinking, first working open-source multimodal reasoning model and Kimi VL Instruct, both 16B MoEs with 3B active params (OS)
> InternVL3 released based on Qwen2.5VL, 7 ckpts with various sizes (1B to 78B)

LLMs
> NVIDIA released Llama-3_1-Nemotron-Ultra-253B-v1 an LLM built on Llama 405B for reasoning, chat and tool use
> Agentica released DeepCoder-14B-Preview, fine-tuned version of DeepSeek-R1-Distilled-Qwen-14B on problem-test pairs, along with the compiled dataset
> Zyphra/ZR1-1.5B is a new small reasoning LLM built on R1-Distill-1.5B (OS)
> Skywork-OR1-32B-Preview is a new reasoning model by Skywork

Image Generation
> HiDream releases three new models, HiDream I1 Dev, I1 Full, and I1 fast for image generation (OS)

*OS ones have Apache 2.0 or MIT licenses
ยท