Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenLLM-Ro 's Collections
RoLlama2
RoMistral
RoLlama3
RoLlama3.1
RoGemma
RoGemma2
Evaluation Datasets
Pretraining Datasets
SFT Datasets
Alignment Datasets

Pretraining Datasets

updated 18 days ago

This collection provides high-quality, large-scale Romanian pretraining datasets derived from FineWeb-2.

Upvote
-

  • OpenLLM-Ro/fineweb2-ro-llm

    Viewer • Updated 18 days ago • 1.06M • 202

  • OpenLLM-Ro/fineweb2-ro-bert

    Viewer • Updated 18 days ago • 54.1M • 818 • 1

  • OpenLLM-Ro/FineWeb2-RoEdu-Classifier

    Updated 18 days ago • 11
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs