LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Paper
β’
2412.15035
β’
Published
β’
4
Open Source Language Models for Europe
scandeval
) to run your own benchmarks with. As part of project "Leesplank" (with Michiel Buisman and Maarten Lens-FitzGerald) we recently added GPT-4-1106-preview scores to add a good "target" to the leaderboard.load_dataset("BramVanroy/hplt_mono_v1_2", "nl_cleaned")