modernBERT for Norwegian.

by hoxmark - opened Jan 31

Discussion

hoxmark

Jan 31

Have you thought about training a Norwegian modernBERT(https://huggingface.co/answerdotai/ModernBERT-base) model?

That would be very useful.

davda54

Language Technology Group (University of Oslo) org Feb 19

Yes, we are planning to release s collection of new NorBERTs that will be more optimized for inference speed :)

hoxmark

Feb 20

Great news! Thanks you.

Will that also include the possible token length?

davda54

Language Technology Group (University of Oslo) org Feb 20

What exactly do you mean by that? :)

hoxmark

Feb 21

•

edited Feb 21

Thank you for answer and my apologies, I some how stopped mid sentence.

My question was suppose to say:
In the blog post introducing modernBERT they also say that they will increase the possibility for a sequence length of up to 8192 tokens. Is this something you will look into doing? :)

davda54

Language Technology Group (University of Oslo) org Feb 21

I see :) Yes, we will increase the sequence length. But note that even the current NorBERT3 is able to accept longer sequences than the 512 tokens it has been trained on, thanks to its bucketed relative positional encoding.

davda54

Language Technology Group (University of Oslo) org Jun 10

•

edited Jun 10

Hi again, it took longer than we thought, but the NorBERT4 models are finally out! They were trained on sequence length of 16834 tokens and offer the same efficiency as ModernBERT. Thanks for pushing us to do this :)

All sizes of the NorBERT4 family:

hoxmark

Jun 11

Wonderful! Thank you very much, I'll test them out in our product next week!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment