Model Specification

  • Model: RoBERTa Tagalog Base (Jan Christian Blaise Cruz)
  • Training Data:
    • Manxโ€“Cadhan corpora (Top 4 Language)
  • Training Details:
    • Base configurations

Evaluation

  • Evaluation Dataset: Universal Dependencies Tagalog Ugnayan (Testing Set)
  • Tested in a zero-shot cross-lingual scenario on a Universal Dependencies Tagalog Ugnayan testing dataset (with 42.04% Accuracy)

POS Tags

  • ADJ โ€“ ADP โ€“ ADV โ€“ CCONJ โ€“ DET โ€“ INTJ โ€“ NOUN โ€“ NUM โ€“ PART โ€“ PRON โ€“ PROPN โ€“ PUNCT โ€“ SCONJ โ€“ VERB
Downloads last month
2
Safetensors
Model size
109M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Dataset used to train iceman2434/roberta-tagalog-base-ft-udpos213-gv

Collection including iceman2434/roberta-tagalog-base-ft-udpos213-gv