Stefan Schweter's picture

Stefan Schweter PRO

stefan-it

·

https://schweter.bayern

AI & ML interests

Flair Library 💕, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models, German Language Models, Bavarian NLP

Recent Activity

liked a dataset about 13 hours ago

HuggingFaceFW/finepdfs

updated a Space 2 days ago

bavarian-nlp/README

published a dataset 2 days ago

bavarian-nlp/bavarian-books-ocred-v0.1

View all activity

Organizations

upvoted a collection 3 days ago

LMEnt

13 items • Updated 8 days ago • 1

upvoted a paper 3 days ago

LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations

Paper • 2509.03405 • Published 4 days ago • 17

upvoted a paper 4 days ago

Mapping Toxic Comments Across Demographics: A Dataset from German Public Broadcasting

Paper • 2508.21084 • Published 12 days ago • 1

upvoted a collection 5 days ago

Apertus LLM

4 items • Updated 6 days ago • 207

upvoted a paper 10 days ago

KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications

Paper • 2503.17247 • Published Mar 21 • 1

upvoted a paper 13 days ago

German4All - A Dataset and Model for Readability-Controlled Paraphrasing in German

Paper • 2508.17973 • Published 13 days ago • 1

upvoted a paper 17 days ago

Influence-driven Curriculum Learning for Pre-training on Limited Data

Paper • 2508.15475 • Published 17 days ago • 1

upvoted a paper 18 days ago

Tokens with Meaning: A Hybrid Tokenization Approach for NLP

Paper • 2508.14292 • Published 19 days ago • 1

upvoted a paper 27 days ago

GLiClass: Generalist Lightweight Model for Sequence Classification Tasks

Paper • 2508.07662 • Published 28 days ago • 8

upvoted 4 collections about 1 month ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 338

German BabyLM

Data that can be used for developing developmentally plausible language models in German. • 13 items • Updated May 28 • 2

Teuken-7B-v0.6

OpenGPT-X Teuken 7B models trained on 6 trillion tokens. • 2 items • Updated Jul 28 • 4

LLäMmlein2Vec 🐑

4 items • Updated Jul 28 • 1

upvoted an article about 1 month ago

Article

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

By

and 2 others •

Jul 25

• 80

upvoted 4 papers about 1 month ago

GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface

Paper • 2507.18546 • Published Jul 24 • 20

Effective Multi-Task Learning for Biomedical Named Entity Recognition

Paper • 2507.18542 • Published Jul 24 • 1

Checklists Are Better Than Reward Models For Aligning Language Models

Paper • 2507.18624 • Published Jul 24 • 2

A New Pair of GloVes

Paper • 2507.18103 • Published Jul 24 • 7

upvoted 2 papers about 2 months ago

Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language

Paper • 2507.16557 • Published Jul 22 • 2

GG-BBQ: German Gender Bias Benchmark for Question Answering

Paper • 2507.16410 • Published Jul 22 • 2