CommonCanvas Collection Collection of models trained on the CommonCatalogue datasets โข 8 items โข Updated May 16, 2024 โข 10
ElanMT Collection Japanese English Machine Translation trained on openly licensed corpus โข 5 items โข Updated Nov 29, 2024 โข 3
OpenCulture Collection A multilingual dataset of public domain books and newspapers. โข 27 items โข Updated Nov 6, 2024 โข 123
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images Paper โข 2310.16825 โข Published Oct 25, 2023 โข 33
Tiny Series Collection Tiny datasets that empower the foundation of Small Language Model! โข 11 items โข Updated Jan 26, 2024 โข 36