readme: add more clarifications about German FineWeb dataset, used for pretraining 8f19a39 verified stefan-it commited on Mar 29