The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations Paper • 2507.13302 • Published 19 days ago • 4
The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations Paper • 2507.13302 • Published 19 days ago • 4
The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations Paper • 2507.13302 • Published 19 days ago • 4
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9 • 9
It's the same but not the same: Do LLMs distinguish Spanish varieties? Paper • 2504.20049 • Published Apr 8
Spanish and LLM Benchmarks: is MMLU Lost in Translation? Paper • 2406.17789 • Published May 28, 2024 • 2
How Stable is Stable Diffusion under Recursive InPainting (RIP)? Paper • 2407.09549 • Published Jun 27, 2024 • 1
Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal Paper • 2408.16012 • Published Aug 16, 2024
Evaluating Large Language Models with Tests of Spanish as a Foreign Language: Pass or Fail? Paper • 2409.15334 • Published Sep 8, 2024 • 1
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper • 2501.09775 • Published Jan 16 • 34
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper • 2501.09775 • Published Jan 16 • 34
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper • 2501.09775 • Published Jan 16 • 34
Evaluating Large Language Models with Tests of Spanish as a Foreign Language: Pass or Fail? Paper • 2409.15334 • Published Sep 8, 2024 • 1
How Stable is Stable Diffusion under Recursive InPainting (RIP)? Paper • 2407.09549 • Published Jun 27, 2024 • 1
The #Somos600M Project: Generating NLP resources that represent the diversity of the languages from LATAM, the Caribbean, and Spain Paper • 2407.17479 • Published Jul 1, 2024 • 1
Spanish and LLM Benchmarks: is MMLU Lost in Translation? Paper • 2406.17789 • Published May 28, 2024 • 2