ChemPile Collection The ChemPile is a dataset with over 75 billion curated multimodal tokens about chemistry. For more information, visit https://chempile.lamalab.org/. • 9 items • Updated 5 days ago • 13
MatText: Do Language Models Need More than Text & Scale for Materials Modeling? Paper • 2406.17295 • Published Jun 25, 2024 • 1
Probing the limitations of multimodal language models for chemistry and materials research Paper • 2411.16955 • Published Nov 25, 2024 • 1
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 33