wiki lucadiliello/wikipedia_512_pretraining Viewer • Updated Mar 24, 2023 • 6.9M • 54 • 14 vocab-transformers/wiki-en-passages-20210101 Viewer • Updated Feb 24, 2022 • 10.2M • 9 • 1 community-datasets/wiki_snippets Viewer • Updated Jun 26, 2024 • 51.4M • 887 • 5 pszemraj/simple_wikipedia Viewer • Updated Sep 9, 2023 • 238k • 389 • 7
Instruction Finetuning Datasets mlabonne/orpo-dpo-mix-40k Viewer • Updated Oct 17, 2024 • 44.2k • 684 • 288 LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 276 • 242 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 2.12k • 746
Claude Data-Agora/general_claude3.5_sonnet_10000 Viewer • Updated Sep 25, 2024 • 10k • 7 • 6 SicariusSicariiStuff/Claude_32K Viewer • Updated Oct 13, 2024 • 30.5k • 23 • 3 seungone/final-math-claude3.5_sonnet-10000 Viewer • Updated Sep 16, 2024 • 10k • 5 • 1 anthracite-org/nopm_claude_writing_fixed Viewer • Updated Aug 18, 2024 • 6.35k • 39 • 14
wiki lucadiliello/wikipedia_512_pretraining Viewer • Updated Mar 24, 2023 • 6.9M • 54 • 14 vocab-transformers/wiki-en-passages-20210101 Viewer • Updated Feb 24, 2022 • 10.2M • 9 • 1 community-datasets/wiki_snippets Viewer • Updated Jun 26, 2024 • 51.4M • 887 • 5 pszemraj/simple_wikipedia Viewer • Updated Sep 9, 2023 • 238k • 389 • 7
Claude Data-Agora/general_claude3.5_sonnet_10000 Viewer • Updated Sep 25, 2024 • 10k • 7 • 6 SicariusSicariiStuff/Claude_32K Viewer • Updated Oct 13, 2024 • 30.5k • 23 • 3 seungone/final-math-claude3.5_sonnet-10000 Viewer • Updated Sep 16, 2024 • 10k • 5 • 1 anthracite-org/nopm_claude_writing_fixed Viewer • Updated Aug 18, 2024 • 6.35k • 39 • 14
Instruction Finetuning Datasets mlabonne/orpo-dpo-mix-40k Viewer • Updated Oct 17, 2024 • 44.2k • 684 • 288 LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 276 • 242 teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 2.12k • 746