hindi-foundational-model-base / tokenizer_config.json
convaiinnovations's picture
Initial upload of custom Hindi LM v1
b253808 verified
raw
history blame
190 Bytes
{
"tokenizer_class": "SentencePieceTokenizerWrapper",
"vocab_size": 16000,
"bos_token_id": 1,
"eos_token_id": 2,
"pad_token_id": 0,
"unk_token_id": 3,
"model_max_length": 512
}