Quentin Lhoest PRO

lhoestq

AI & ML interests

Maintainer of 🤗Datasets: NLP, Multimodal data processing and sharing

Articles

Organizations

lhoestq's activity

upvoted an article 1 day ago
view article
Article

Introducing the SQL Console on Datasets

9
upvoted an article 22 days ago
view article
Article

Scaling robotics datasets with video encoding

31
upvoted an article 23 days ago
view article
Article

Deep Learning over the Internet: Training Language Models Collaboratively

4
upvoted 2 articles about 1 month ago
view article
Article

⭐ PySpark and 🤗 Hugging Face Parquet Files

By asoria
5
view article
Article

XetHub is joining Hugging Face!

76
upvoted an article about 2 months ago
view article
Article

WWDC 24: Running Mistral 7B with Core ML

54
upvoted 7 articles 2 months ago
view article
Article

Docmatix - a huge dataset for Document Visual Question Answering

63
view article
Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

17
view article
Article

Enhancing Search Capabilities for Non-English Datasets in the Dataset Viewer

By asoria
4
view article
Article

Experimenting with Automatic PII Detection on the Hub using Presidio

23
view article
Article

Announcing New Dataset Search Features

22
upvoted an article 3 months ago
upvoted 3 articles 3 months ago
view article
Article

Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation

12
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
upvoted an article 4 months ago
upvoted an article 5 months ago
view article
Article

Synthetic data: save money, time and carbon with open source

45
upvoted an article 5 months ago
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

23