Why is the "add_bos_token" set to True in tokenizer_config.json?

#17
by heya5 - opened

Should I keep the bos_token_id in my input_ids for downstream tasks?

Hey @heyaa,

OPT uses a GPT2Tokenizer but prepends every prompt with a BOS TOKEN (e.g. <s> Hello there instead of Hello there)

Sign up or log in to comment