Test model, trained on variable masking ratio on instruction data in the format
[CLS]User: Tell me ... [SEP] Assistant[MASK][MASK][MASK] you [MASK] problem [SEP]
See this notebook for sampling.
Use at own risk, based on answerdotai/ModernBERT-large :)