OpenEuroLLM

community

https://openeurollm.eu/

OpenEuroLLM

openeurollm

Activity Feed Request to join this org

AI & ML interests

Open, Multilingual, European, Generative, Foundational LLM

Recent Activity

Zihao-Li authored a paper 9 days ago

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

Zihao-Li authored a paper 9 days ago

GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models

Zihao-Li authored a paper 9 days ago

Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources

View all activity

Zihao-Li

authored 6 papers 9 days ago

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

Paper • 2409.17892 • Published Sep 26, 2024 • 2

GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models

Paper • 2504.04155 • Published Apr 5 • 1

Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources

Paper • 2504.04152 • Published Apr 5 • 1

Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data

Paper • 2506.00469 • Published May 31 • 2

A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives

Paper • 2407.15489 • Published Jul 22, 2024

Scaling Low-Resource MT via Synthetic Data Generation with LLMs

Paper • 2505.14423 • Published May 20

Villekom

updated a model 11 days ago

openeurollm/eu4t_6040

Updated 11 days ago • 198

timurcarstensen

authored a paper about 2 months ago

DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products

Paper • 2502.10297 • Published Feb 14

Villekom

authored 2 papers 2 months ago

Got Compute, but No Data: Lessons From Post-training a Finnish LLM

Paper • 2503.09407 • Published Mar 12 • 1

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2503.10267 • Published Mar 13 • 1

geoalgo

authored a paper 3 months ago

ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning

Paper • 2409.18827 • Published Sep 27, 2024

tiedeman

authored a paper 3 months ago

Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data

Paper • 2506.00469 • Published May 31 • 2

tiedeman

authored a paper 6 months ago

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2503.10267 • Published Mar 13 • 1

mbanon

authored a paper 6 months ago

An Expanded Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2503.10267 • Published Mar 13 • 1

flxst

authored 2 papers 7 months ago

GPT-SW3: An Autoregressive Language Model for the Nordic Languages

Paper • 2305.12987 • Published May 22, 2023

Better Embeddings with Coupled Adam

Paper • 2502.08441 • Published Feb 12 • 1

mbanon

authored 2 papers 8 months ago

A New Massive Multilingual Dataset for High-Performance Language Technologies

Paper • 2403.14009 • Published Mar 20, 2024 • 1

FastSpell: the LangId Magic Spell

Paper • 2404.08345 • Published Apr 12, 2024

tiedeman

authored 2 papers 11 months ago

Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging

Paper • 2304.04726 • Published Apr 10, 2023

Sentence Embeddings in NLI with Iterative Refinement Encoders

Paper • 1808.08762 • Published Aug 27, 2018

AI & ML interests

Recent Activity

Team members 19

openeurollm's activity