Elsayed Mohamed's picture

Elsayed Mohamed

sayedM

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago
OmarSamir/EGTTS-V0.1
liked a Space about 8 hours ago
deepseek-ai/Janus-Pro-7B
liked a Space 2 days ago
m42-health/MEDIC-Benchmark
View all activity

Organizations

Pxivision's profile picture

sayedM's activity

liked a Space 2 days ago
upvoted an article 2 days ago
view article
Article

We now support VLMs in smolagents!

49
reacted to m-ric's post with 🔥 2 days ago
view post
Post
2196
Today we make the biggest release in smolagents so far: 𝘄𝗲 𝗲𝗻𝗮𝗯𝗹𝗲 𝘃𝗶𝘀𝗶𝗼𝗻 𝗺𝗼𝗱𝗲𝗹𝘀, 𝘄𝗵𝗶𝗰𝗵 𝗮𝗹𝗹𝗼𝘄𝘀 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗽𝗼𝘄𝗲𝗿𝗳𝘂𝗹 𝘄𝗲𝗯 𝗯𝗿𝗼𝘄𝘀𝗶𝗻𝗴 𝗮𝗴𝗲𝗻𝘁𝘀! 🥳

Our agents can now casually open up a web browser, and navigate on it by scrolling, clicking elements on the webpage, going back, just like a user would.

The demo below shows Claude-3.5-Sonnet browsing GitHub for task: "Find how many commits the author of the current top trending repo did over last year."
Hi @mlabonne !

Go try it out, it's the most cracked agentic stuff I've seen in a while 🤯 (well, along with OpenAI's Operator who beat us by one day)

For more detail, read our announcement blog 👉 https://huggingface.co/blog/smolagents-can-see
The code for the web browser example is here 👉 https://github.com/huggingface/smolagents/blob/main/examples/vlm_web_browser.py
·
upvoted an article 2 days ago
view article
Article

Introducing smolagents: simple agents that write actions in code.

530
reacted to alibabasglab's post with 👍 6 days ago
updated a Space about 1 month ago
liked a Space about 1 month ago