BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 33
view article Article From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub By jsulz and 3 others • Feb 12 • 72
File names and splits Collection 8 datasets showcase the diversity of splits configuration on HuggingFace. See docs: https://huggingface.co/docs/hub/datasets-file-names-and-splits. • 8 items • Updated Nov 22, 2023 • 9
Audio dataset Collection N datasets showcase how to configure and load audio datasets • 11 items • Updated Aug 2, 2024 • 2
Image dataset Collection 10 datasets showcase how to configure and load image datasets • 10 items • Updated Aug 2, 2024 • 7
Format: CSV and TSV Collection 6 datasets showcase how to configure and load CSV and TSV files. • 6 items • Updated Nov 23, 2023 • 6
Manual Configuration Collection 5 datasets showcase YAML configuration on HuggingFace. See docs: https://huggingface.co/docs/hub/datasets-manual-configuration. • 5 items • Updated Nov 23, 2023 • 6
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 417