Stefan Schweter PRO
stefan-it
AI & ML interests
Flair Library, NER & PoS Tagging, LM Pretraining (mostly encoder-only), Historical Language Models
Recent Activity
upvoted
an
article
2 days ago
FineWeb2-C: Help Build Better Language Models in Your Language
upvoted
a
paper
3 days ago
GEITje 7B Ultra: A Conversational Model for Dutch
upvoted
a
paper
3 days ago
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language
Models
Articles
Organizations
stefan-it's activity
upvoted
an
article
2 days ago
Article
FineWeb2-C: Help Build Better Language Models in Your Language
By
•
•
10jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images
Paper
•
2412.08802
•
Published
•
4
Evaluating Pixel Language Models on Non-Standardized Languages
Paper
•
2412.09084
•
Published
•
1
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain
Paper
•
2412.09341
•
Published
•
1
OpenNER 1.0: Standardized Open-Access Named Entity Recognition Datasets in 50+ Languages
Paper
•
2412.09587
•
Published
•
3
The Impact of Copyrighted Material on Large Language Models: A Norwegian Perspective
Paper
•
2412.09460
•
Published
•
5
upvoted
an
article
21 days ago
Article
They Said It Couldn’t Be Done
By
•
•
75upvoted
a
paper
about 1 month ago
upvoted
a
collection
about 1 month ago
upvoted
a
paper
about 1 month ago
Toxicity of the Commons: Curating Open-Source Pre-Training Data
Paper
•
2410.22587
•
Published
•
10
Representation Deficiency in Masked Language Modeling
Paper
•
2302.02060
•
Published
•
1
GPT or BERT: why not both?
Paper
•
2410.24159
•
Published
•
14
Zipfian Whitening
Paper
•
2411.00680
•
Published
•
9
WikiNER-fr-gold: A Gold-Standard NER Corpus
Paper
•
2411.00030
•
Published
•
4
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
Paper
•
2410.20771
•
Published
•
3
upvoted
a
paper
2 months ago