Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Spaces:
Allanatrix
/
Nexa_Data_Studio
Running

App Files Files Community
Fetching metadata from the HF Docker repository...
Nexa_Data_Studio / Tokenization
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
Allanatrix's picture
Allanatrix
Delete Tokenization/app.py
4dad6cd verified 6 days ago
  • Logs
    Upload 50 files 6 days ago
  • __pycache__
    Upload 50 files 6 days ago
  • app
    Upload 50 files 6 days ago
  • preprocessing
    Upload 50 files 6 days ago
  • pretraining
    Upload 50 files 6 days ago
  • Build_tokenizer.py
    3.48 kB
    Upload 50 files 6 days ago
  • Cleanser.py
    3.26 kB
    Upload 50 files 6 days ago
  • Entropy_ranker.py
    2.35 kB
    Upload 50 files 6 days ago
  • Label_tokens.py
    1.72 kB
    Upload 50 files 6 days ago
  • Main_2.py
    39.2 kB
    Upload 50 files 6 days ago
  • __init__.py
    603 Bytes
    Upload 50 files 6 days ago
  • combined_scientific_papers.json
    1.11 MB
    Upload 50 files 6 days ago
  • combined_scientific_papers.jsonl
    1.11 MB
    Upload 50 files 6 days ago
  • corpus_builder.log
    0 Bytes
    Upload 50 files 6 days ago
  • debug_upload.log
    27.9 kB
    Upload 50 files 6 days ago
  • generate_dataset.py
    3.49 kB
    Upload 50 files 6 days ago
  • hf_upload.py
    6.21 kB
    Upload 50 files 6 days ago
  • requirements.txt
    119 Bytes
    Upload 50 files 6 days ago
  • run_backend.py
    284 Bytes
    Upload 50 files 6 days ago