A high quality Vietnamese pretraining dataset for LLMs
UET-IAI-NLP-ViEduQALLMs
community
AI & ML interests
None defined yet.
Recent Activity
Collections
1
models
0
None public yet
datasets
13
group2sealion/vnu_crawl
Viewer
•
Updated
•
42.2k
•
102
group2sealion/15mil_milestone
Viewer
•
Updated
•
2.43M
•
29
group2sealion/4mil_milestone
Viewer
•
Updated
•
2.53M
•
47
group2sealion/11mil_last
Viewer
•
Updated
•
1.85M
•
51
group2sealion/8mil_last
Viewer
•
Updated
•
1.85M
•
35
group2sealion/last_result
Viewer
•
Updated
•
1.82M
•
29
group2sealion/8mil_last_domains
Viewer
•
Updated
•
338k
•
28
group2sealion/8mil_clean
Viewer
•
Updated
•
1.73M
•
29
group2sealion/11mil_clean
Viewer
•
Updated
•
1.73M
•
25
group2sealion/11mil_milestone
Viewer
•
Updated
•
1.9M
•
38