ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance Paper • 2504.08716 • Published Apr 11 • 10
CamemBERT 2.0: A Smarter French Language Model Aged to Perfection Paper • 2411.08868 • Published Nov 13, 2024 • 13
Harvesting Textual and Structured Data from the HAL Publication Repository Paper • 2407.20595 • Published Jul 30, 2024 • 22
AraGPT2: Pre-Trained Transformer for Arabic Language Generation Paper • 2012.15520 • Published Dec 31, 2020
AraELECTRA: Pre-Training Text Discriminators for Arabic Language Understanding Paper • 2012.15516 • Published Dec 31, 2020 • 2
AraBERT: Transformer-based Model for Arabic Language Understanding Paper • 2003.00104 • Published Feb 28, 2020 • 3