Hi, the repository README mentions:

By default, input text longer than 128 word pieces is truncated.

However, the parameter max_seq_length from sentence_transformers returns 512.

from sentence_transformers import SentenceTransformer
model_st = SentenceTransformer('all-mpnet-base-v1')
# 512

Same value is returned for the Hugging face transformer approach:

from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained('sentence-transformers/all-mpnet-base-v1')
# 512

Shouldn't the README be updated from 128 to 512 ?

I have the same question! Looking to embed text up to the maximum sequence length of 512. I am assuming it won't be truncated at 128 despite what the README says?

That's a great observation, thank you for posting this

