Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GermanT5
/
occiglot5
like
1
Follow
GermanT5
10
TensorBoard
Safetensors
occiglot/occiglot-fineweb-v1.0
HuggingFaceFW/fineweb
HuggingFaceFW/fineweb-edu
English
German
t5
arxiv:
1910.10683
arxiv:
2205.05131
arxiv:
2109.10686
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
occiglot5
Ctrl+K
Ctrl+K
1 contributor
History:
17 commits
stefan-it
readme: add more clarifications about German FineWeb dataset, used for pretraining
8f19a39
verified
3 months ago
.gitattributes
Safe
1.57 kB
figures: add initial version of logo
3 months ago
README.md
Safe
1.06 kB
readme: add more clarifications about German FineWeb dataset, used for pretraining
3 months ago
config.json
Safe
811 Bytes
model: add initial config
3 months ago
early-access.png
Safe
12 kB
figures: add early access logo
3 months ago
events.out.tfevents.1734425863.t1v-n-7c8a43e7-w-3.313043.0.v2
Safe
10 MB
LFS
metrics: add original TensorBoard logs
3 months ago
model-00001-of-00002.safetensors
Safe
5 GB
LFS
model: add initial version (part 1 of 2)
3 months ago
model-00002-of-00002.safetensors
Safe
702 MB
LFS
model: add initial version (part 2 of 2)
3 months ago
model.safetensors.index.json
Safe
75.8 kB
model: add initial version (index.json)
3 months ago
occiglot5_logo.png
Safe
1.37 MB
LFS
figures: add initial version of logo
3 months ago
spiece.model
Safe
771 kB
LFS
tokenizer: add SPM model
3 months ago
spiece.vocab
Safe
562 kB
tokenizer: add SPM vocab
3 months ago
tokenizer_config.json
Safe
1.92 kB
tokenizer: add config
3 months ago