Predicting the Type and Target of Offensive Posts in Social Media Paper • 1902.09666 • Published Feb 25, 2019
Beyond English-Only Reading Comprehension: Experiments in Zero-Shot Multilingual Transfer for Bulgarian Paper • 1908.01519 • Published Aug 5, 2019
SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification Paper • 2004.14454 • Published Apr 29, 2020
On the Effect of Dropping Layers of Pre-trained Transformer Models Paper • 2004.03844 • Published Apr 8, 2020 • 1
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020) Paper • 2006.07235 • Published Jun 12, 2020
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection Paper • 2305.14902 • Published May 24, 2023 • 1
We Can Detect Your Bias: Predicting the Political Ideology of News Articles Paper • 2010.05338 • Published Oct 11, 2020
Factcheck-GPT: End-to-End Fine-Grained Document-Level Fact-Checking and Correction of LLM Output Paper • 2311.09000 • Published Nov 15, 2023 • 1
EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering Paper • 2011.03080 • Published Nov 5, 2020
bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark Paper • 2306.02349 • Published Jun 4, 2023
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text Paper • 2306.05540 • Published May 23, 2023
AraStance: A Multi-Country and Multi-Domain Dataset of Arabic Stance Detection for Fact Checking Paper • 2104.13559 • Published Apr 28, 2021
RuleBert: Teaching Soft Rules to Pre-trained Language Models Paper • 2109.13006 • Published Sep 24, 2021
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs Paper • 2308.13387 • Published Aug 25, 2023 • 1
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic Paper • 2402.12840 • Published Feb 20, 2024 • 1
EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models Paper • 2403.10378 • Published Mar 15, 2024
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification Paper • 2403.04696 • Published Mar 7, 2024 • 4
Semantic Ranking for Automated Adversarial Technique Annotation in Security Text Paper • 2403.17068 • Published Mar 25, 2024
Can a Multichoice Dataset be Repurposed for Extractive Question Answering? Paper • 2404.17342 • Published Apr 26, 2024 • 1
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs Paper • 2405.05583 • Published May 9, 2024