A collection of processed CommonCrawl data as part of the BigBanyanTree initiative. Each dataset is extracted from a random 1% sample of the data.
-
big-banyan-tree/BBT_CommonCrawl_2018
Viewer • Updated • 61.5M • 186 • 3 -
big-banyan-tree/BBT_CommonCrawl_2019
Viewer • Updated • 55.8M • 112 • 2 -
big-banyan-tree/BBT_CommonCrawl_2020
Viewer • Updated • 46.9M • 87 • 2 -
big-banyan-tree/BBT_CommonCrawl_2021
Viewer • Updated • 48.5M • 556 • 2