french-datasets / index.html
de-francophones's picture
Update index.html
1f452de verified
<!doctype html>
<html>
<head>
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width" />
<title>My static Space</title>
<link rel="stylesheet" href="style.css" />
</head>
<body>
<div>
<h1>Vous trouverez ci-dessous une liste de 258 jeux de données en français mal référencés sur le Hub :<br>
Below is a list of 258 French datasets that are badly referenced on the Hub:</h1>
</div>
<br><br>
<div style="column-count: 2; column-gap: 40px;">
<a href="https://huggingface.co/datasets/adiren7/darija_to_french_speech_to_text">adiren7/darija_to_french_speech_to_text</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/ASN_Lettres_De_Suivi_filtered">AdrienB134/ASN_Lettres_De_Suivi_filtered</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/ASN_pairs">AdrienB134/ASN_pairs</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/easyfinetune_Instruct_test">AdrienB134/easyfinetune_Instruct_test</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/easyfinetune_QA_test">AdrienB134/easyfinetune_QA_test</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/Emilia-dataset-french-split">AdrienB134/Emilia-dataset-french-split</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/french-tts-mul">AdrienB134/french-tts-mul</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/french-unique-speaker-tts">AdrienB134/french-unique-speaker-tts</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/Instruct_ASN_medium">AdrienB134/Instruct_ASN_medium</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/Instruct_ASN_small">AdrienB134/Instruct_ASN_small</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/QA_ASN_small">AdrienB134/QA_ASN_small</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/QA_ASN_test">AdrienB134/QA_ASN_test</a><br>
<a href="https://huggingface.co/datasets/AdrienB134/Small-markdown">AdrienB134/Small-markdown</a><br>
<a href="https://huggingface.co/datasets/Adjoumani/translations_french_baoule_V1">Adjoumani/translations_french_baoule_V1</a><br>
<a href="https://huggingface.co/datasets/adlbh/rekrute-2005-2022">adlbh/rekrute-2005-2022</a><br>
<a href="https://huggingface.co/datasets/adwaitagashe/bordIRlines">adwaitagashe/bordIRlines</a><br>
<a href="https://huggingface.co/datasets/ahmadSiddiqi/x-stance_fr">ahmadSiddiqi/x-stance_fr</a><br>
<a href="https://huggingface.co/datasets/ahazeemi/iwslt14-en-fr">ahazeemi/iwslt14-en-fr</a><br>
<a href="https://huggingface.co/datasets/ai4bharat/intel">ai4bharat/intel</a><br>
<a href="https://huggingface.co/datasets/ai4bharat/recon ">ai4bharat/recon</a><br>
<a href="https://huggingface.co/datasets/allenai/WildChat-1M">allenai/WildChat-1M</a><br>
<a href="https://huggingface.co/datasets/almanach/LADaS">almanach/LADaS</a><br>
<a href="https://huggingface.co/datasets/alvations/body-parts">body-parts</a><br>
<a href="https://huggingface.co/datasets/alvations/c4p0-v1-en-fr">alvations/c4p0-v1-en-fr</a><br>
<a href="https://huggingface.co/datasets/alvations/c4p0-v1-fr-en">alvations/c4p0-v1-fr-en</a><br>
<a href="https://huggingface.co/datasets/alvations/c4p0-v2-en-fr">alvations/c4p0-v2-en-fr</a><br>
<a href="https://huggingface.co/datasets/alvations/c4p0-v2-fr-en">alvations/c4p0-v2-fr-en</a><br>
<a href="https://huggingface.co/datasets/alvations/dslml24-jelly-submission-fr">alvations/dslml24-jelly-submission-fr</a><br>
<a href="https://huggingface.co/datasets/alvations/food-and-beverage">alvations/food-and-beverage</a><br>
<a href="https://huggingface.co/datasets/alvations/units">alvations/units</a><br>
<a href="https://huggingface.co/datasets/alvations/xnli-15way">alvations/xnli-15way</a><br>
<a href="https://huggingface.co/datasets/Alwaly/fr_voxpopuli/">Alwaly/fr_voxpopuli/</a><br>
<a href="https://huggingface.co/datasets/Alwaly/french-Wolof-lang-classification">Alwaly/french-Wolof-lang-classification</a><br>
<a href="https://huggingface.co/datasets/Alwaly/frenchToWolof">Alwaly/frenchToWolof</a><br>
<a href="https://huggingface.co/datasets/Alwaly/frenchToWolof_">Alwaly/frenchToWolof_</a><br>
<a href="https://huggingface.co/datasets/Alwaly/multilingual-wolof-french-asr">Alwaly/multilingual-wolof-french-asr</a><br>
<a href="https://huggingface.co/datasets/Alwaly/multilingual-wolof-french-en">Alwaly/multilingual-wolof-french-en</a><br>
<a href="https://huggingface.co/datasets/AmazonScience/mintaka">AmazonScience/mintaka</a><br>
<a href="https://huggingface.co/datasets/arbml/UFAL">arbml/UFAL</a><br>
<a href="https://huggingface.co/datasets/astha/languagemodelsforRNNdecomposition">astha/languagemodelsforRNNdecomposition</a><br>
<a href="https://huggingface.co/datasets/babs/unlabelled-french-voxpopuli">babs/unlabelled-french-voxpopuli</a><br>
<a href="https://huggingface.co/datasets/beethogedeon/fr_fon">beethogedeon/fr_fon</a><br>
<a href="https://huggingface.co/datasets/BitTranslate/chatgpt-prompts-French">BitTranslate/chatgpt-prompts-French</a><br>
<a href="https://huggingface.co/datasets/bio-datasets/e3c">bio-datasets/e3c</a><br>
<a href="https://huggingface.co/datasets/bosbos/french_english_instruct">bosbos/french_english_instruct</a><br>
<a href="https://huggingface.co/datasets/Brendan/nlp244_french_snli">Brendan/nlp244_french_snli</a><br>
<a href="https://huggingface.co/datasets/chocobearz/BERSt">chocobearz/BERSt</a><br>
<a href="https://huggingface.co/datasets/cjvt/janes_preklop">cjvt/janes_preklop</a><br>
<a href="https://huggingface.co/datasets/CohereForAI/m-ArenaHard">CohereForAI/m-ArenaHard</a><br>
<a href="https://huggingface.co/datasets/coref-data/corefud_raw">coref-data/corefud_raw</a><br>
<a href="https://huggingface.co/datasets/corto-ai/open-australian-legal-multi-lingual-qa">corto-ai/open-australian-legal-multi-lingual-qa</a><br>
<a href="https://huggingface.co/datasets/Databasesprojec/FinStmts_ConsUncons_French_Predict_part1">Databasesprojec/FinStmts_ConsUncons_French_Predict_part1</a><br>
<a href="https://huggingface.co/datasets/Databasesprojec/FinStmts_ConsUncons_French_Predict_part2">Databasesprojec/FinStmts_ConsUncons_French_Predict_part2</a><br>
<a href="https://huggingface.co/datasets/Databasesprojec/FinStmts_ConsUncons_French_SeqClass">Databasesprojec/FinStmts_ConsUncons_French_SeqClass</a><br>
<a href="https://huggingface.co/datasets/Databasesprojec/FinStmts_ConsUncons_Reduced_UndersampleMajority_French_SeqClass">Databasesprojec/FinStmts_ConsUncons_Reduced_UndersampleMajority_French_SeqClass</a><br>
<a href="https://huggingface.co/datasets/Databoost/TTS_Multilingual_Data">Databoost/TTS_Multilingual_Data</a><br>
<a href="https://huggingface.co/datasets/dataset-rewriter/SmallTalkDialogues-10-translated-to-proper-French-466b">dataset-rewriter/SmallTalkDialogues-10-translated-to-proper-French-466b</a><br>
<a href="https://huggingface.co/datasets/dataset-rewriter/SmallTalkDialogues-translated-to-proper-French-466b ">dataset-rewriter/SmallTalkDialogues-translated-to-proper-French-466b </a><br>
<a href="https://huggingface.co/datasets/ekazuki/french_deputies_tweet">ekazuki/french_deputies_tweet</a><br>
<a href="https://huggingface.co/datasets/ekazuki/french_deputies_tweet_old">ekazuki/french_deputies_tweet_old</a><br>
<a href="https://huggingface.co/datasets/ekazuki/french_deputies_tweet_sentiment">ekazuki/french_deputies_tweet_sentiment</a><br>
<a href="https://huggingface.co/datasets/ekazuki/text_to_french_parliament_group">ekazuki/text_to_french_parliament_group</a><br>
<a href="https://huggingface.co/datasets/ekazuki/text_to_french_parliament_group_beta">ekazuki/text_to_french_parliament_group_beta</a><br>
<a href="https://huggingface.co/datasets/ekazuki/text_to_french_parliament_group_debates">ekazuki/text_to_french_parliament_group_debates</a><br>
<a href="https://huggingface.co/datasets/ekazuki/text_to_french_parliament_group_written_questions">ekazuki/text_to_french_parliament_group_written_questions</a><br>
<a href="https://huggingface.co/datasets/EssalhiSara/french.corpus">EssalhiSara/french.corpus</a><br>
<a href="https://huggingface.co/datasets/EssalhiSara/French_corpus">EssalhiSara/French_corpus</a><br>
<a href="https://huggingface.co/datasets/Farah21/frenchOrientation">Farah21/frenchOrientation</a><br>
<a href="https://huggingface.co/datasets/fdaudens/aya_dataset_french_example">fdaudens/aya_dataset_french_example</a><br>
<a href="https://huggingface.co/datasets/fdaudens/aya_french_dpo">fdaudens/aya_french_dpo</a><br>
<a href="https://huggingface.co/datasets/ferrazzipietro/e3c">ferrazzipietro/e3c</a><br>
<a href="https://huggingface.co/datasets/FreedomIntelligence/ApolloCorpus">FreedomIntelligence/ApolloCorpus</a><br>
<a href="https://huggingface.co/datasets/FreedomIntelligence/alpaca-gpt4-french">FreedomIntelligence/alpaca-gpt4-french</a><br>
<a href="https://huggingface.co/datasets/FreedomIntelligence/MMLU_French">FreedomIntelligence/MMLU_French</a><br>
<a href="https://huggingface.co/datasets/FreedomIntelligence/sharegpt-french">FreedomIntelligence/sharegpt-french</a><br>
<a href="https://huggingface.co/datasets/freds0/cml_tts_dataset_french">freds0/cml_tts_dataset_french</a><br>
<a href="https://huggingface.co/datasets/gasp/french_rap_songs">gasp/french_rap_songs</a><br>
<a href="https://huggingface.co/datasets/Geraldine/bso-publications-indexation-50k">Geraldine/bso-publications-indexation-50k</a><br>
<a href="https://huggingface.co/datasets/gmnlp/tico19">gmnlp/tico19</a><br>
<a href="https://huggingface.co/datasets/GregoryD/explicit-function-calling-french">GregoryD/explicit-function-calling-french</a><br>
<a href="https://huggingface.co/datasets/gustawdaniel/ngram-google-2012">gustawdaniel/ngram-google-2012</a><br>
<a href="https://huggingface.co/datasets/Hazzzardous/synthetic-translations-6k-unvalidated">Hazzzardous/synthetic-translations-6k-unvalidated</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_100k">hcoxec/french_100k</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_danish">hcoxec/french_danish</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_danish_mix">hcoxec/french_danish_mix</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_finnish">hcoxec/french_finnish</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_finnish_mix">hcoxec/french_finnish_mix</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_german">hcoxec/french_german</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_german_mix">hcoxec/french_german_mix</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_romanian">hcoxec/french_romanian</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_romanian_mix">hcoxec/french_romanian_mix</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_spanish">hcoxec/french_spanish</a><br>
<a href="https://huggingface.co/datasets/hcoxec/french_spanish_mix">hcoxec/french_spanish_mix</a><br>
<a href="https://huggingface.co/datasets/imvladikon/paranames">imvladikon/paranames</a><br>
<a href="https://huggingface.co/datasets/infinite-dataset-hub/NoGunFranceText">infinite-dataset-hub/NoGunFranceText</a><br>
<a href="https://huggingface.co/datasets/infinite-dataset-hub/Pedale-FrenchTextCorpus">infinite-dataset-hub/Pedale-FrenchTextCorpus</a><br>
<a href="https://huggingface.co/datasets/Intuit-GenSRF/all_french_datasets">Intuit-GenSRF/all_french_datasets</a><br>
<a href="https://huggingface.co/datasets/iot/eng_to_french">iot/eng_to_french</a><br>
<a href="https://huggingface.co/datasets/ismailiismail/French_English_2">ismailiismail/French_English_2</a><br>
<a href="https://huggingface.co/datasets/ismailiismail/FrEn_handpicks">ismailiismail/FrEn_handpicks</a><br>
<a href="https://huggingface.co/datasets/ismailiismail/ner">ismailiismail/ner</a><br>
<a href="https://huggingface.co/datasets/ismailiismail/multi_paraphrasing_french">ismailiismail/multi_paraphrasing_french</a><br>
<a href="https://huggingface.co/datasets/ismailiismail/paragraphss_paraphrasing">ismailiismail/paragraphss_paraphrasing</a><br>
<a href="https://huggingface.co/datasets/ismailiismail/paraphrasing_french">ismailiismail/paraphrasing_french</a><br>
<a href="https://huggingface.co/datasets/ismailiismail/paraphrasing_french_5000">ismailiismail/paraphrasing_french_5000</a><br>
<a href="https://huggingface.co/datasets/iix/Parquet_FIles">iix/Parquet_FIles</a><br>
<a href="https://huggingface.co/datasets/Jasgui11/French/">Jasgui11/French</a><br>
<a href="https://huggingface.co/datasets/JohnnyEudora/Translation/">JohnnyEudora/Translationh</a><br>
<a href="https://huggingface.co/datasets/juletxara/pawsx_mt">juletxara/pawsx_mt</a><br>
<a href="https://huggingface.co/datasets/juletxara/mgsm_mt">juletxara/mgsm_mt</a><br>
<a href="https://huggingface.co/datasets/juletxara/xnli_mt">juletxara/xnli_mt</a><br>
<a href="https://huggingface.co/datasets/jwang214/arc_french/">jwang214/arc_french</a><br>
<a href="https://huggingface.co/datasets/jzhang86/fr_ifeval/">jzhang86/fr_ifeval</a><br>
<a href="https://huggingface.co/datasets/jzhang86/frmmlu_no_train/">jzhang86/frmmlu_no_train</a><br>
<a href="https://huggingface.co/datasets/kaitchup/opus-English-to-French">kaitchup/opus-English-to-French</a><br>
<a href="https://huggingface.co/datasets/kaitchup/opus-French-to-English">kaitchup/opus-French-to-English</a><br>
<a href="https://huggingface.co/datasets/kloodia/alpaca_french">kloodia/alpaca_french</a><br>
<a href="https://huggingface.co/datasets/lidiapierre/fr_sexism_labelled">lidiapierre/fr_sexism_labelled</a><br>
<a href="https://huggingface.co/datasets/lincoln/newsquadfr">lincoln/newsquadfr</a><br>
<a href="https://huggingface.co/datasets/lightblue/mitsu">lightblue/mitsu</a><br>
<a href="https://huggingface.co/datasets/llama-lang-adapt/wura">llama-lang-adapt/wura</a><br>
<a href="https://huggingface.co/datasets/LsTam/CQUAE_documents">LsTam/CQUAE_documents</a><br>
<a href="https://huggingface.co/datasets/LsTam/generated_user_questions_samplemd">LsTam/generated_user_questions_samplemd</a><br>
<a href="https://huggingface.co/datasets/LsTam/opus_instruction_format ">LsTam/opus_instruction_format</a><br>
<a href="https://huggingface.co/datasets/LsTam/raw_samples_md ">LsTam/raw_samples_md</a><br>
<a href="https://huggingface.co/datasets/lyon-nlp/mteb-fr-reranking-syntec-s2p">lyon-nlp/mteb-fr-reranking-syntec-s2p</a><br>
<a href="https://huggingface.co/datasets/m-biriuchinskii/ICDAR2017-filtered-1800-1900-3">m-biriuchinskii/ICDAR2017-filtered-1800-1900-3</a><br>
<a href="https://huggingface.co/datasets/m-biriuchinskii/ICDAR2017-filtered-1800-1900-4">m-biriuchinskii/ICDAR2017-filtered-1800-1900-4</a><br>
<a href="https://huggingface.co/datasets/m-biriuchinskii/ICDAR2017-filtered-1800-1900-5">m-biriuchinskii/ICDAR2017-filtered-1800-1900-5</a><br>
<a href="https://huggingface.co/datasets/Makxxx/wikinews">Makxxx/wikinews</a><br>
<a href="https://huggingface.co/datasets/malteos/french_CEFR">malteos/french_CEFR</a><br>
<a href="https://huggingface.co/datasets/manu/croissant_french_dataset">manu/croissant_french_dataset</a><br>
<a href="https://huggingface.co/datasets/manu/dataset_en_fr">manu/dataset_en_fr</a><br>
<a href="https://huggingface.co/datasets/manu/dataset_en_fr_short">manu/dataset_en_fr_short</a><br>
<a href="https://huggingface.co/datasets/manu/dila_legifrance">manu/dila_legifrance</a><br>
<a href="https://huggingface.co/datasets/manu/europarl-en-fr">manu/europarl-en-fr</a><br>
<a href="https://huggingface.co/datasets/manu/fr_corpora_parliament_processed-lowercased">manu/fr_corpora_parliament_processed-lowercased</a><br>
<a href="https://huggingface.co/datasets/manu/french-30b">manu/french-30b</a><br>
<a href="https://huggingface.co/datasets/manu/french-30b_separate">manu/french-30b_separate</a><br>
<a href="https://huggingface.co/datasets/manu/french-bench-grammar-vocab-reading">manu/french-bench-grammar-vocab-reading</a><br>
<a href="https://huggingface.co/datasets/manu/french_5p">manu/french_5p</a><br>
<a href="https://huggingface.co/datasets/manu/french_5p_separate">manu/french_5p_separate</a><br>
<a href="https://huggingface.co/datasets/manu/french_bench_arc_challenge">manu/french_bench_arc_challenge</a><br>
<a href="https://huggingface.co/datasets/manu/french_bench_hellaswag">manu/french_bench_hellaswag</a><br>
<a href="https://huggingface.co/datasets/manu/french_boolq">manu/french_boolq</a><br>
<a href="https://huggingface.co/datasets/manu/french_librispeech_text_only">manu/french_librispeech_text_only</a><br>
<a href="https://huggingface.co/datasets/manu/french_poetry">manu/french_poetry</a><br>
<a href="https://huggingface.co/datasets/manu/old_french_30b_separate">manu/old_french_30b_separate</a><br>
<a href="https://huggingface.co/datasets/manu/opus100-en-fr">manu/opus100-en-fr</a><br>
<a href="https://huggingface.co/datasets/manu/theses_fr_2013_2023">manu/theses_fr_2013_2023</a><br>
<a href="https://huggingface.co/datasets/manu/tok-corpus-shuffled">manu/tok-corpus-shuffled</a><br>
<a href="https://huggingface.co/datasets/manu/wikisource_fr">manu/wikisource_fr</a><br>
<a href="https://huggingface.co/datasets/manu/wmt-en-fr">manu/wmt-en-fr</a><br>
<a href="https://huggingface.co/datasets/mattlc/french_multicorpus_tft_v040">mattlc/french_multicorpus_tft_v040</a><br>
<a href="https://huggingface.co/datasets/MBZUAI/ALM-Bench">MBZUAI/ALM-Bench</a><br>
<a href="https://huggingface.co/datasets/MBZUAI/MINT_BAK">MBZUAI/MINT_BAK</a><br>
<a href="https://huggingface.co/datasets/MBZUAI/multilingual-llava-bench-in-the-wild">MBZUAI/multilingual-llava-bench-in-the-wild</a><br>
<a href="https://huggingface.co/datasets/MBZUAI/palo_multilingual_dataset">MBZUAI/palo_multilingual_dataset</a><br>
<a href="https://huggingface.co/datasets/MBZUAI-Paris/Darija-SFT-Mixture">MBZUAI-Paris/Darija-SFT-Mixture</a><br>
<a href="https://huggingface.co/datasets/md-nishat-008/Mojo_Corpus">md-nishat-008/Mojo_Corpus</a><br>
<a href="https://huggingface.co/datasets/Mediform/sharegpt-french">Mediform/sharegpt-french</a><br>
<a href="https://huggingface.co/datasets/mgb-dx-meetup/product-reviews">mgb-dx-meetup/product-reviews</a><br>
<a href="https://huggingface.co/datasets/Michielo/Merged-LID-20">Michielo/Merged-LID-20</a><br>
<a href="https://huggingface.co/datasets/MilaNLProc/honest">MilaNLProc/honest</a><br>
<a href="https://huggingface.co/datasets/misclassified/meps_speeches">misclassified/meps_speeches</a><br>
<a href="https://huggingface.co/datasets/musts/french">musts/french</a><br>
<a href="https://huggingface.co/datasets/nedjmaou/MLMA_hate_speech">nedjmaou/MLMA_hate_speech</a><br>
<a href="https://huggingface.co/datasets/nguyenthanhasia/MSD_multilingual">nguyenthanhasia/MSD_multilingual</a><br>
<a href="https://huggingface.co/datasets/nielsr/datacomp_small_french_captions">nielsr/datacomp_small_french_captions</a><br>
<a href="https://huggingface.co/datasets/nirantk/french-books">nirantk/french-books</a><br>
<a href="https://huggingface.co/datasets/nickcpk/handcrafted_en_fr_data">nickcpk/handcrafted_en_fr_data</a><br>
<a href="https://huggingface.co/datasets/odunola/french-audio-preprocessed">odunola/french-audio-preprocessed</a><br>
<a href="https://huggingface.co/datasets/odunola/french-english-preprocessed">odunola/french-english-preprocessed</a><br>
<a href="https://huggingface.co/datasets/odunola/french-english-unprocessed">odunola/french-english-unprocessed</a><br>
<a href="https://huggingface.co/datasets/odunola/french-preprocessed-2/">odunola/french-preprocessed-2/</a><br>
<a href="https://huggingface.co/datasets/odunola/french-preprocessed-test">odunola/french-preprocessed-test</a><br>
<a href="https://huggingface.co/datasets/odunola/opsci/Astree">opsci/Astree</a><br>
<a href="https://huggingface.co/datasets/paulml/chatml-OpenHermes2.5-dpo-binarized-alpha-french">paulml/chatml-OpenHermes2.5-dpo-binarized-alpha-french</a><br>
<a href="https://huggingface.co/datasets/Panoramax/fr_road_sign_subsign">Panoramax/fr_road_sign_subsign</a><br>
<a href="https://huggingface.co/datasets/PHBJT/cml-tts">PHBJT/cml-tts</a><br>
<a href="https://huggingface.co/datasets/PHBJT/cml-tts-20percent-subset">PHBJT/cml-tts-20percent-subset</a><br>
<a href="https://huggingface.co/datasets/PHBJT/cml-tts-20percent-subset-description">PHBJT/cml-tts-20percent-subset-description</a><br>
<a href="https://huggingface.co/datasets/PITTI/MicRou">PITTI/MicRou</a><br>
<a href="https://huggingface.co/datasets/PITTI/MicRou_chunked">PITTI/MicRou_chunked</a><br>
<a href="https://huggingface.co/datasets/PleIAs/AMF-PDF">PleIAs/AMF-PDF</a><br>
<a href="https://huggingface.co/datasets/PleIAs/AMF-Text">PleIAs/AMF-Text</a><br>
<a href="https://huggingface.co/datasets/PleIAs/common_corpus">PleIAs/common_corpus</a><br>
<a href="https://huggingface.co/datasets/PleIAs/FrenchCompariaCategorised">PleIAs/FrenchCompariaCategorised</a><br>
<a href="https://huggingface.co/datasets/PleIAs/GATT_library">PleIAs/GATT_library</a><br>
<a href="https://huggingface.co/datasets/PleIAs/KaribuAI">PleIAs/KaribuAI</a><br>
<a href="https://huggingface.co/datasets/PleIAs/Multilingual-PD">PleIAs/Multilingual-PD</a><br>
<a href="https://huggingface.co/datasets/PleIAs/Pleias-1.0-eval">PleIAs/Pleias-1.0-eval</a><br>
<a href="https://huggingface.co/datasets/PleIAs/RAG-Evals">PleIAs/RAG-Evals</a><br>
<a href="https://huggingface.co/datasets/PleIAs/TEDEUTenders">PleIAs/TEDEUTenders</a><br>
<a href="https://huggingface.co/datasets/PleIAs/WTO-PDF">PleIAs/WTO-PDF</a><br>
<a href="https://huggingface.co/datasets/PleIAs/WTO-Text">PleIAs/WTO-Text</a><br>
<a href="https://huggingface.co/datasets/Poulpidot/FrenchHateSpeechSuperset">Poulpidot/FrenchHateSpeechSuperset</a><br>
<a href="https://huggingface.co/datasets/ProfessorBob/E5-finetune-dataset">ProfessorBob/E5-finetune-dataset</a><br>
<a href="https://huggingface.co/datasets/ProfessorBob/instruct-MultiQ3">ProfessorBob/instruct-MultiQ3</a><br>
<a href="https://huggingface.co/datasets/ProfessorBob/keyword_extraction">ProfessorBob/keyword_extraction</a><br>
<a href="https://huggingface.co/datasets/ProfessorBob/Long_context_chunking">ProfessorBob/Long_context_chunking</a><br>
<a href="https://huggingface.co/datasets/ProfessorBob/text-embedding-dataset">ProfessorBob/text-embedding-dataset</a><br>
<a href="https://huggingface.co/datasets/Punchwe/ted_talk_multi_parallel">Punchwe/ted_talk_multi_parallel</a><br>
<a href="https://huggingface.co/datasets/pvisnrt/french-snli">pvisnrt/french-snli</a><br>
<a href="https://huggingface.co/datasets/qanastek/ECDC">qanastek/ECDC</a><br>
<a href="https://huggingface.co/datasets/RaiBP/openwebtext2-first-30-chunks-lang-detect-raw-output">RaiBP/openwebtext2-first-30-chunks-lang-detect-raw-output</a><br>
<a href="https://huggingface.co/datasets/rasaboun/french">rasaboun/french</a><br>
<a href="https://huggingface.co/datasets/rcds/MultiLegalNeg">rcds/MultiLegalNeg</a><br>
<a href="https://huggingface.co/datasets/rcds/slds">rcds/slds</a><br>
<a href="https://huggingface.co/datasets/rish16/MLe-SNLI">rish16/MLe-SNLI</a><br>
<a href="https://huggingface.co/datasets/Sabrina1763/wikipedia_french">Sabrina1763/wikipedia_french</a><br>
<a href="https://huggingface.co/datasets/sakthivinash/Language_Detection">sakthivinash/Language_Detection</a><br>
<a href="https://huggingface.co/datasets/sagot/lefff_morpho">sagot/lefff_morpho</a><br>
<a href="https://huggingface.co/datasets/SEACrowd/paracotta_id">SEACrowd/paracotta_id</a><br>
<a href="https://huggingface.co/datasets/SergeiZu/french-film-reviews">SergeiZu/french-film-reviews</a><br>
<a href="https://huggingface.co/datasets/shuyuej/French-MedExpQA-Benchmark">shuyuej/French-MedExpQA-Benchmark</a><br>
<a href="https://huggingface.co/datasets/shuyuej/French-MMLU-Anatomy-Benchmark">shuyuej/French-MMLU-Anatomy-Benchmark</a><br>
<a href="https://huggingface.co/datasets/shuyuej/French-MMLU-Clinical-Knowledge-Benchmark">shuyuej/French-MMLU-Clinical-Knowledge-Benchmark</a><br>
<a href="https://huggingface.co/datasets/shuyuej/French-MMLU-College-Biology-Benchmark">shuyuej/French-MMLU-College-Biology-Benchmark</a><br>
<a href="https://huggingface.co/datasets/shuyuej/French-MMLU-College-Medicine-Benchmark">shuyuej/French-MMLU-College-Medicine-Benchmark</a><br>
<a href="https://huggingface.co/datasets/shuyuej/French-MMLU-Medical-Genetics-Benchmark">shuyuej/French-MMLU-Medical-Genetics-Benchmark</a><br>
<a href="https://huggingface.co/datasets/shuyuej/French-MMLU-Professional-Medicine-Benchmark">shuyuej/French-MMLU-Professional-Medicine-Benchmark</a><br>
<a href="https://huggingface.co/datasets/startlingadama/bambara-french">startlingadama/bambara-french</a><br>
<a href="https://huggingface.co/datasets/StephanAkkerman/frequency-words-2018">StephanAkkerman/frequency-words-2018</a><br>
<a href="https://huggingface.co/datasets/stefan-it/autotrain-flair-hipe2022-de-hmbert">stefan-it/autotrain-flair-hipe2022-de-hmbert</a><br>
<a href="https://huggingface.co/datasets/sugam11/french-snli">sugam11/french-snli</a><br>
<a href="https://huggingface.co/datasets/tamedai/oscar_eu_6x3M">tamedai/oscar_eu_6x3M</a><br>
<a href="https://huggingface.co/datasets/tbboukhari/Alpaca-in-french">tbboukhari/Alpaca-in-french</a><br>
<a href="https://huggingface.co/datasets/the-french-artist/hatvp_declarations_text_index_embeds">the-french-artist/hatvp_declarations_text_index_embeds</a><br>
<a href="https://huggingface.co/datasets/Tngarg/french_eng">Tngarg/french_eng</a><br>
<a href="https://huggingface.co/datasets/Tngarg/french_english">Tngarg/french_english</a><br>
<a href="https://huggingface.co/datasets/Tngarg/French_of">Tngarg/French_of</a><br>
<a href="https://huggingface.co/datasets/Tngarg/french_train">Tngarg/french_train</a><br>
<a href="https://huggingface.co/datasets/TrainingDataPro/amazon-reviews-dataset">TrainingDataPro/amazon-reviews-dataset</a><br>
<a href="https://huggingface.co/datasets/UdyanSachdev/Multi_Language_Audio2Text"> UdyanSachdev/Multi_Language_Audio2Text</a><br>
<a href="https://huggingface.co/datasets/unicamp-dl/mmarco"> unicamp-dl/mmarco</a><br>
<a href="https://huggingface.co/datasets/unicamp-dl/mrobust">unicamp-dl/mrobust </a><br>
<a href="https://huggingface.co/datasets/uvci/koumankan4dyula">uvci/koumankan4dyula </a><br>
<a href="https://huggingface.co/datasets/vekkt/french_CEFR">vekkt/french_CEFR</a><br>
<a href="https://huggingface.co/datasets/Vivian12300/mathqa_test_French_by_llama-8B-instruct">Vivian12300/mathqa_test_French_by_llama-8B-instruct</a><br>
<a href="https://huggingface.co/datasets/WhissleAI/multilingual-libri-test-french">WhissleAI/multilingual-libri-test-french</a><br>
<a href="https://huggingface.co/datasets/wraps/everyday-conversations-llama3.1-2k-french">wraps/everyday-conversations-llama3.1-2k-french</a><br>
<a href="https://huggingface.co/datasets/yzhuang/arc_challenge_test_French_by_Meta-Llama-3-8B-Instruct">yzhuang/arc_challenge_test_French_by_Meta-Llama-3-8B-Instruct</a><br>
<a href="https://huggingface.co/datasets/yzhuang/mathqa_test_French_by_Meta-Llama-3-8B-Instruct">yzhuang/mathqa_test_French_by_Meta-Llama-3-8B-Instruct</a><br>
<a href="https://huggingface.co/datasets/yzhuang/mmlu_test_French_by_Meta-Llama-3-8B-Instruct">yzhuang/mmlu_test_French_by_Meta-Llama-3-8B-Instruct</a><br>
<a href="https://huggingface.co/datasets/yezhengli9/wmt20-de-fr">yezhengli9/wmt20-de-fr</a><br>
<a href="https://huggingface.co/datasets/yezhengli9/wmt20-fr-de">yezhengli9/wmt20-fr-de</a><br>
<a href="https://huggingface.co/datasets/wasertech/TrainingSpeech">wasertech/TrainingSpeech</a><br>
<a href="https://huggingface.co/datasets/zelros/insurance-fr">zelros/insurance-fr</a><br>
<a href="https://huggingface.co/datasets/zelros/pj">zelros/pj</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-axa">zelros/pj-axa</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-ca">zelros/pj-ca</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-ce">zelros/pj-ce</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-da">zelros/pj-da</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-groupama">zelros/pj-groupama</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-maif">zelros/pjmaif</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-lbp">zelros/pj-lbp</a><br>
<a href="https://huggingface.co/datasets/zelros/pj-sg">zelros/pj-sg</a><br>
</div>
</body>
</html>