Florent Daudens
fdaudens
AI & ML interests
AI & Journalism
Recent Activity
liked
a model
about 8 hours ago
all-hands/openhands-lm-32b-v0.1
posted
an
update
about 13 hours ago
Did we just drop personalized AI evaluation?! This tool auto-generates custom benchmarks on your docs to test which models are the best.
Most benchmarks test general capabilities, but what matters is how models handle your data and tasks. YourBench helps answer critical questions like:
- Do you really need a hundreds-of-billions-parameter model sledgehammer to crack a nut?
- Could a smaller, fine-tuned model work better?
- How well do different models understand your domain?
Some cool features:
📚 Generates custom benchmarks from your own documents (PDFs, Word, HTML)
🎯 Tests models on real tasks, not just general capabilities
🔄 Supports multiple models for different pipeline stages
🧠 Generate both single-hop and multi-hop questions
🔍 Evaluate top models and deploy leaderboards instantly
💰 Full cost analysis to optimize for your budget
🛠️ Fully configurable via a single YAML file
26 SOTA models tested for question generation. Interesting finding: Qwen2.5 32B leads in question diversity, while smaller Qwen models and Gemini 2.0 Flash offer great value for cost.
You can also run it locally on any models you want.
I'm impressed. Try it out: https://huggingface.co/spaces/yourbench/demo
liked
a Space
about 13 hours ago
yourbench/demo
Organizations
fdaudens's activity
Update README.md
1
#152 opened about 2 months ago
by
fdaudens

Best NLP tutorials?
1
#12 opened 4 months ago
by
ajwl
What stands out to you the most?
5
#4 opened 4 months ago
by
fdaudens

links from the predictions don't work for me
4
#1 opened 4 months ago
by
clem

open link in new tab
#3 opened 4 months ago
by
abhishek

Open links in another tab
2
#2 opened 4 months ago
by
nbroad

AI Agents for SQL queries
#10 opened 6 months ago
by
fdaudens

Hi! Introduce yourself! 👋
21
#2 opened 11 months ago
by
fdaudens

Best tools to demo journalists right now?
1
#8 opened 7 months ago
by
spencc

New AI bias detection tool for artificial images: Test skin tone and gender bias instantly
6
#7 opened 9 months ago
by
fdaudens

Update code.gs
2
#1 opened 10 months ago
by
louisbrulenaudet

Hugging Face in Sheets
#6 opened 10 months ago
by
fdaudens

Are there any issues that would need to be implemented with which I could help?
2
#3 opened 10 months ago
by
LeonIngelse
I getting this issue when use with API
3
#2 opened 10 months ago
by
colornative
replacing textbox with a JSON component
1
#1 opened 11 months ago
by
ysharma

No-Code Website Scraping
2
#5 opened 11 months ago
by
fdaudens

Librarian Bot: Add language metadata for dataset
#1 opened 11 months ago
by
librarian-bot

Algorithmic bias
1
#4 opened 11 months ago
by
fdaudens

New Whisper implementation optimized for speaker diarization
1
#3 opened 11 months ago
by
smach
