Clémentine Fourrier

clefourrier

AI & ML interests

None yet

Recent Activity

Organizations

Hugging Face's profile picture Long Range Graph Benchmark's profile picture Evaluation datasets's profile picture BigScience: LMs for Historical Texts's profile picture HuggingFaceBR4's profile picture Huggingface Projects's profile picture Open Graph Benchmark's profile picture HuggingFaceGECLM's profile picture Pretrained Graph Transformers's profile picture Graph Datasets's profile picture BigCode's profile picture Hugging Face H4's profile picture InternLM's profile picture Vectara's profile picture GAIA's profile picture Hugging Face Smol Cluster's profile picture plfe's profile picture Open LLM Leaderboard's profile picture Qwen's profile picture Secure Learning Lab's profile picture Open Life Science AI's profile picture LLM360's profile picture TTS Eval (OLD)'s profile picture hallucinations-leaderboard's profile picture Bias Leaderboard Development's profile picture Leaderboard Organization's profile picture Demo Leaderboard's profile picture Demo leaderboard with an integrated backend's profile picture gg-hf's profile picture AIM-Harvard's profile picture Clinical & Biomedical ML Leaderboards's profile picture Women on Hugging Face's profile picture LMLLO2's profile picture Lighthouz AI's profile picture Open Arabic LLM Leaderboard's profile picture mx-test's profile picture IBM Granite's profile picture FineData's profile picture HF-contamination-detection's profile picture TTS AGI's profile picture Leader Board Test Org's profile picture Social Post Explorers's profile picture hsramall's profile picture Open RL Leaderboard's profile picture The Fin AI's profile picture La Leaderboard's profile picture Open Hebrew LLM's Leaderboard's profile picture gg-tt's profile picture HuggingFaceEval's profile picture HP Inc.'s profile picture Novel Challenge's profile picture Open LLM Leaderboard Archive's profile picture LLHF's profile picture SLLHF's profile picture lbhf's profile picture nltpt's profile picture Lighteval testing org's profile picture CléMax's profile picture Hugging Face Science's profile picture test_org's profile picture Coordination Nationale pour l'IA's profile picture LeMaterial's profile picture open-llm-leaderboard-react's profile picture Prompt Leaderboard's profile picture UBC-NLP Collaborations's profile picture smolagents's profile picture Your Bench's profile picture leaderboard explorer's profile picture Open R1's profile picture SIMS's profile picture OpenEvals's profile picture GeekAgents's profile picture piupiu.xyz's profile picture PoseidonDemo's profile picture interview 4298's profile picture LightEval Internal Testing's profile picture

Posts 18

view post
Post
452
Always surprised that so few people actually read the FineTasks blog, on
✨how to select training evals with the highest signal✨

If you're serious about training models without wasting compute on shitty runs, you absolutely should read it!!

An high signal eval actually tells you precisely, during training, how wel & what your model is learning, allowing you to discard the bad runs/bad samplings/...!

The blog covers in depth prompt choice, metrics, dataset, across languages/capabilities, and my fave section is "which properties should evals have"👌
(to know on your use case how to select the best evals for you)

Blog: HuggingFaceFW/blogpost-fine-tasks

Articles 34

Article
29

Fixing Open LLM Leaderboard with Math-Verify

datasets 0

None public yet