Datasets
updated
shayekh/perplexity__aya_dataset__train
Updated • 10
Viewer
• Updated • 540k • 29
• 1
argilla/magpie-ultra-v0.1
Viewer
• Updated • 50k • 689
• 221
Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1
Viewer
• Updated • 1M • 118
• 14
HuggingFaceTB/smollm-corpus
Viewer
• Updated • 237M • 36.6k
• 444
Viewer
• Updated • 100k • 7.3k
• 265
BanglaLLM/bangla-alpaca-orca
Viewer
• Updated • 172k • 47
• 4
AhmadMustafa/Urdu-Instruct-News-Article-Generation
Viewer
• Updated • 112k • 25
• 4
AhmadMustafa/Urdu-Instruct-News-Headline-Generation
Viewer
• Updated • 112k • 12
AhmadMustafa/Urdu-Instruct-News-Category-Classification
Viewer
• Updated • 112k • 27
Viewer
• Updated • 10k • 298
• 54
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft
Viewer
• Updated • 6.37M • 58
• 2
CohereLabs/aya_collection_language_split
Viewer
• Updated • 514M • 3.75k
• 114
Viewer
• Updated • 63k • 175
• 35
Viewer
• Updated • 21.9M • 1.83k
• 700
convaiinnovations/Nadi_Indic466k_Instruct
Viewer
• Updated • 466k • 7
• 2
ai4bharat/indic-instruct-data-v0.1
Viewer
• Updated • 404k • 276
• 25
Viewer
• Updated • 9.97k • 29
• 2
MarkrAI/KoCommercial-Dataset
Viewer
• Updated • 175k • 520
• 165