mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Genomic Next-Token Predictors are In-Context Learners
Controlled Generation for Private Synthetic Text
models 53
jhu-clsp/mmBERT-small
Fill-Mask • Updated
• 21.4k • • 64
jhu-clsp/mmBERT-base
Fill-Mask • Updated
• 270k • • 193
jhu-clsp/mmBERT-checkpoints
Updated
• 4
jhu-clsp/ettin-decoder-1b
Fill-Mask • Updated
• 182 • 5
jhu-clsp/ettin-decoder-32m
Text Generation • Updated
• 264
jhu-clsp/ettin-encoder-1b
Feature Extraction • Updated
• 734 • 21
jhu-clsp/ettin-encoder-68m
Fill-Mask • Updated
• 9.35k • 3
jhu-clsp/ettin-dec-from-enc-32m
Text Generation • Updated
• 4
jhu-clsp/ettin-encoder-150m
Fill-Mask • Updated
• 21.4k • • 9
jhu-clsp/ettin-decoder-400m
Text Generation • Updated
• 145 • 4
datasets 38
jhu-clsp/megawika-2
Updated
• 144 • 2
jhu-clsp/mmBERT-decay-data
Updated
• 12.4k • 5
jhu-clsp/mmBERT-midtraining-data
Updated
• 16.4k • 1
jhu-clsp/ettin-pretraining-data
Updated
• 5.65k • 8
jhu-clsp/ettin-decay-data
Updated
• 1.8k • 1
jhu-clsp/astro-llms-benchmark-dataset
Viewer
• Updated
• 40 • 110
jhu-clsp/astro-llms-full-query-data
Viewer
• Updated
• 368 • 99
jhu-clsp/ettin-extension-data
Updated
• 1.62k
jhu-clsp/ettin-data-order
Viewer
• Updated
• 3B • 5 • 1
jhu-clsp/rank1-R1-MSMARCO
Viewer
• Updated
• 635k • 43 • 2