view post Post 1994 The folks at Foursquare released a dataset of 104.5 million places of interest ( foursquare/fsq-os-places) and here's all of them on a plot See translation 4 replies Β· π₯ 5 5 π 1 1 π 1 1 + Reply
view post Post 2427 The Lichess database of games, puzzles, and engine evaluations is now on the Hub: Lichess Billions of chess data points to download, query, and stream and we're excited to see what you'll build with it! βοΈ π€- Lichess/positions-datasets-66f50837db5cd3287d60d489- https://huggingface.co/collections/Lichess/games-datasets-66f508df78f4b43e1bb2d353 See translation π 7 7 β€οΈ 2 2 π₯ 1 1 + Reply
StarCoder 2 and The Stack v2: The Next Generation Paper β’ 2402.19173 β’ Published Feb 29, 2024 β’ 147
view post Post Data map of the languages of https://huggingface.co/datasets/CohereForAI/aya_dataset 1 reply Β· β€οΈ 12 12 + Reply
view post Post TIL: EleutherAI/pile is on Wikipedia: https://en.wikipedia.org/wiki/The_Pile_(dataset) π€― 5 5 π€ 4 4 β€οΈ 1 1 + Reply