Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
onekqΒ 
posted an update 2 days ago
Post
287
Announce πŸŽ‰ WebApp1K-Duo πŸŽ‰
onekq-ai/WebApp1K-Duo-React

This is to keep up the challenge after OpenAI o1 models saturated the WebApp1K benchmark. The new benchmark brings SOTA to 67%. Let the hill climbing commence!
onekq-ai/WebApp1K-models-leaderboard

PS: I will publish more findings soon.
In this post