Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kenhktsui 's Collections
LongTalk
FastText Model for Pretraining Data Curation
textbook-quality-classifier
CoT
nano-phi
VLM Data

FastText Model for Pretraining Data Curation

updated 5 days ago
Upvote
3

  • kenhktsui/llm-data-textbook-quality-fasttext-classifier-v2

    Text Classification • Updated Nov 28, 2024 • 1.34k • 27

  • kenhktsui/fineweb-edu-fasttext-classifier

    Text Classification • Updated Jun 6, 2024 • 7 • 4

  • kenhktsui/code-natural-language-fasttext-classifier

    Text Classification • Updated Oct 30, 2024 • 4.96k • 1

  • kenhktsui/math-fasttext-classifier

    Text Classification • Updated Feb 26 • 9 • 1

  • kenhktsui/finefineweb-domain-fasttext-classifier

    Text Classification • Updated Mar 16 • 2 • 1
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs