Clémentine Fourrier's picture

Clémentine Fourrier

clefourrier

·

http://clefourrier.github.io

AI & ML interests

None yet

Recent Activity

updated a dataset about 4 hours ago

gaia-benchmark/results_public

updated a dataset about 23 hours ago

gaia-benchmark/results_public

updated a dataset 2 days ago

gaia-benchmark/results_public

View all activity

Organizations

liked a Space 4 months ago

Evaluation Guidebook

Explore LLM benchmark trends over time

liked 4 Spaces 5 months ago

Z Image Turbo

Generate images from text prompts with customizable resolution

Benchmark Finder

A space to view and inspect all the tasks in lighteval

InferenceProviderTestingBackend

Launch and monitor model evaluation jobs

Comparia Leaderboard

Leaderboard of the French government's chatbot arena

liked a Space 6 months ago

The Smol Training Playbook

The secrets to building world-class LLMs

liked a dataset 6 months ago

gaia-benchmark/GAIA

Viewer • Updated Oct 28, 2025 • 932 • 22.3k • 644

liked 4 Spaces 7 months ago

VideoEval-Pro Leaderboard

A more robust benchmark for long video understanding.

VibeGame

Vibe code 3D games

Guide sur l'évaluation des LLM

Traduction du guide de Clémentine Fourrier

Gaia2 Agents Evaluation Leaderboard

View and compare Gaia2 benchmark leaderboards for AI models

liked a dataset 7 months ago

meta-agents-research-environments/gaia2

Viewer • Updated Sep 25, 2025 • 963 • 12.2k • 40

liked 6 Spaces 7 months ago

Meta Agents Research Environments Demo

Explore Meta Agents research environments via web interface

Nano Banana PRO

Nano Banana for Hugging Face PRO users

PwCLeaderboardDisplay

Visualize ML model performance over time

Qwen Image

Generate detailed images from your text prompts

HFBA

A collection of Huggies!

41 LLMs Evaluated Locally on 19 Benchmarks

41 open-source LLMs benchmarked locally on 19 tasks.

liked a Space 8 months ago

Bringing paper to life: A modern template for scientific writing

Explore a scientific article with interactive visualizations

liked a dataset 8 months ago

THU-KEG/IFBench

Viewer • Updated Mar 7, 2025 • 444 • 133 • 10