Prashant Dandriyal's picture
1 32

Prashant Dandriyal

pedrostu
ยท

AI & ML interests

None yet

Recent Activity

Organizations

None yet

pedrostu's activity

reacted to m-ric's post with ๐Ÿ˜” about 2 months ago
view post
Post
4856
"๐Ÿฎ๐Ÿฌ๐Ÿฎ๐Ÿฑ ๐˜„๐—ถ๐—น๐—น ๐—ฏ๐—ฒ ๐˜๐—ต๐—ฒ ๐˜†๐—ฒ๐—ฎ๐—ฟ ๐—ผ๐—ณ ๐—”๐—œ ๐—ฎ๐—ด๐—ฒ๐—ป๐˜๐˜€": this statement has often been made, here are numbers to support it.

I've plotted the progress of AI agents on GAIA test set, and it seems they're headed to catch up with the human baseline in early 2026.

And that progress is still driven mostly by the improvement of base LLMs: progress would be even faster with fine-tuned agentic models.
upvoted an article 3 months ago
view article
Article

Introducing the Open Chain of Thought Leaderboard

โ€ข 31