SmolLM3 pretraining datasets Collection datasets used in SmolLM3 pretraining • 14 items • Updated about 13 hours ago • 8
SmolLM3 evaluation datasets Collection Datasets to decontaminate the post-training mixtures against. Use the subset and column values described per entry • 13 items • Updated about 13 hours ago • 4