ChemPile Collection The ChemPile is a dataset with over 77 billion curated multimodal tokens about chemistry. For more information, visit https://chempile.lamalab.org/. • 8 items • Updated May 5 • 19
AI scientists produce results without reasoning scientifically Paper • 2604.18805 • Published Apr 20 • 7
ChemBench-Collection Collection Datasets, Spaces and Results related to ChemBench • 4 items • Updated Oct 3, 2025 • 4
ChemPile: A 250GB Diverse and Curated Dataset for Chemical Foundation Models Paper • 2505.12534 • Published May 18, 2025 • 3
view article Article A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard +1 ofermend, minseokbae, clefourrier • Jan 12, 2024 • 8