Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
qq8933Β 
posted an update Apr 22
Post
2045
ChemLLM datasets is all open source now!
ChemLLM: A Chemical Large Language Model (2402.06852)
700K of SFT Dataset, ChemData700K For Chemistry of LLM!
AI4Chem/ChemData700K
10K of DPO Dataset, ChemPref-10K, both English and Chinese!
AI4Chem/ChemPref-DPO-for-Chemistry-data-en
AI4Chem/ChemPref-DPO-for-Chemistry-data-cn
ChemBench-4K of 4100 high-quality single-choice benchmark for nine core Chemistry tasks!
AI4Chem/ChemBench4K
C-MHChem, 600 real test questions written and checked manually, from 25 years of Chinese National Middle school chemistry Test!
AI4Chem/C-MHChem-Benchmark-Chinese-Middle-high-school-Chemistry-Test
All hail to Open-source community!πŸ€—
In this post