@kaikaidai on Hugging Face: "📈 Early results on the 8B evaluation model we've been training... @NinaCalvi…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

kaikaidai

posted an update Dec 4, 2024

Post

1112

📈 Early results on the 8B evaluation model we've been training...

@NinaCalvi wrote about the progress we've made this quarter towards training the best 'LLM-as-a-judge' evaluator. We've significantly improved against the baseline and are approaching state-of-the-art evaluation performance with an 8B model.

Next up: training Llama-3.1-70B 👀

Here's the full article: https://www.atla-ai.com/post/evaluating-the-evaluator

Taylor658

Dec 4, 2024

Nice Article! Does Atla-1-mini or its eval framework natively support function calling?

tobydrane

Dec 5, 2024

Hi @Taylor658 , the base model does natively support function calling. We have been running some internal tests of our models capability for function calling as this is something we might look to expose via our API in the near future.

In this post

kaikaidai kyle
Taylor658 atayloraerospace
tobydrane Toby Drane