Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Hanbaike
/
kyrgyz_spm_tokenizer

Kyrgyz
kyrgyz
tokenization
sentencepiece
BPE
Unigram
Model card Files Files and versions Community
kyrgyz_spm_tokenizer
Ctrl+K
Ctrl+K
  • 1 contributor
History: 9 commits
Hanbaike's picture
Hanbaike
Update README.md
d7f8f25 verified 17 days ago
  • models
    Upload folder using huggingface_hub 17 days ago
  • text
    Upload folder using huggingface_hub 17 days ago
  • .gitattributes
    1.97 kB
    Upload folder using huggingface_hub 17 days ago
  • .gitignore
    0 Bytes
    Upload folder using huggingface_hub 17 days ago
  • .gitignoreer
    0 Bytes
    Upload folder using huggingface_hub 17 days ago
  • README.md
    6.78 kB
    Update README.md 17 days ago
  • clean.py
    3.58 kB
    Upload folder using huggingface_hub 17 days ago
  • efficiency.py
    1.82 kB
    Upload folder using huggingface_hub 17 days ago
  • graph.jpg
    64 kB
    Upload folder using huggingface_hub 17 days ago
  • kyrgyz_clean_sentences.txt
    300 MB
    LFS
    Upload folder using huggingface_hub 17 days ago
  • readme.md
    6.88 kB
    Upload folder using huggingface_hub 17 days ago
  • sample.py
    707 Bytes
    Upload folder using huggingface_hub 17 days ago
  • special_tokens_map.json
    114 Bytes
    Upload folder using huggingface_hub 17 days ago
  • symbols.py
    1.15 kB
    Upload folder using huggingface_hub 17 days ago
  • tokenization.py
    1.04 kB
    Upload folder using huggingface_hub 17 days ago
  • tokenizer_config.json
    227 Bytes
    Upload folder using huggingface_hub 17 days ago
  • upload_models.py
    5.09 kB
    Upload folder using huggingface_hub 17 days ago
  • user_defined_symbols.txt
    71 Bytes
    Upload folder using huggingface_hub 17 days ago