Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10.0
TFLOPS
6
2
2
kyle
PRO
kaikaidai
Follow
bergr7f's profile picture
inwaves's profile picture
HennersBro98's profile picture
4 followers
ยท
15 following
kaikaidai
AI & ML interests
None yet
Recent Activity
updated
a Space
7 days ago
AtlaAI/judge-arena
new
activity
10 days ago
AtlaAI/judge-arena:
Promotion to get more voters
posted
an
update
22 days ago
๐ Early results on the 8B evaluation model we've been training... @NinaCalvi wrote about the progress we've made this quarter towards training the best 'LLM-as-a-judge' evaluator. We've significantly improved against the baseline and are approaching state-of-the-art evaluation performance with an 8B model. Next up: training Llama-3.1-70B ๐ Here's the full article: https://www.atla-ai.com/post/evaluating-the-evaluator
View all activity
Articles
Judge Arena: Benchmarking LLMs as Evaluators
Nov 19
โข
53
Experimenting with different training objectives for an AI evaluator
Oct 31
โข
2
Organizations
kaikaidai
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
about 1 month ago
Running
on
CPU Upgrade
544
๐
TTS Arena
Vote on the latest TTS models!
liked
a Space
about 2 months ago
Running
83
๐ป
Judge Arena