File size: 529 Bytes
25ba7b8 9d999f1 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 |
---
license: bsd
language:
- sk
base_model:
- FacebookAI/roberta-base
---
# Slovak Roberta Base
A monolingual Slovak language model.
Model was trained on a collection of Slovak web pages from various sources.
## Training parameters
We used 4 x A100 40GB GPU for 14 hours.
- Effective batch size: 192
- Sequence length 512
- Training Steps 120 000.
- warmup_steps 1000
- optimizer adamw
- Per device batch size 48
- mixed_precision bf16
- weight decay 0.01
- gradient clipping 1.0
- learning_rate 1e-5
- scheduler cosine
|