AI & ML interests

Web as a corpus, Large Language Models, Machine Translation, Language Technologies, Natural Language Processing, Internet Archive, CommonCrawl

Recent Activity

bhavitvyamalik  updated a dataset 13 days ago
HPLT/DocHPLT
pinzhenchen  updated a dataset 16 days ago
HPLT/DocHPLT
View all activity

HPLT 's collections 9

Multilingual Translation Models
Translation models trained on OPUS data including HPLT datasets
Multilingual Translation Models
Translation models trained on OPUS data including HPLT datasets