LSX-UniWue
/

ModernGBERT_134M

Feature Extraction

Model card Files Files and versions

ModernGBERT_134M / README.md

JanPf's picture

Update README.md

d6434fc verified 5 days ago

|

history blame contribute delete

1 kB

	---
	datasets:
	- togethercomputer/RedPajama-Data-V2
	language:
	- de
	library_name: transformers
	license: other
	tags:
	- fill-mask
	- masked-lm
	- long-context
	- modernbert
	pipeline_tag: feature-extraction
	---

	# ModernGBERT 134M

	This is a German ModernBERT 134M language model trained from scratch using the ModernBERT [codebase](https://github.com/AnswerDotAI/ModernBERT) and the same German portion of [RedPajama V2](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-V2) as our [LLäMmlein](https://huggingface.co/collections/LSX-UniWue/llammlein-6732ff41f3705c686e605762) family.
	Find more details in our [preprint](https://arxiv.org/abs/2505.13136)!

	### Usage

	```python
	from transformers import AutoModel, AutoTokenizer

	model = AutoModel.from_pretrained("LSX-UniWue/ModernGBERT_134M")

	tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/ModernGBERT_134M")
	```


	### Performance
	We evaluated our model on the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark.