LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning Paper • 2305.18169 • Published May 29, 2023 • 1
ParsiNLU: A Suite of Language Understanding Challenges for Persian Paper • 2012.06154 • Published Dec 11, 2020
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition Paper • 2306.02873 • Published Jun 5, 2023 • 1
Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages Paper • 2203.14139 • Published Mar 26, 2022 • 1
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers Paper • 2205.03286 • Published May 6, 2022
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
BERT on a Data Diet: Finding Important Examples by Gradient-Based Pruning Paper • 2211.05610 • Published Nov 10, 2022
Comparative Study of Multilingual Idioms and Similes in Large Language Models Paper • 2410.16461 • Published Oct 21, 2024
Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation Paper • 2412.13375 • Published Dec 17, 2024
GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction Paper • 2505.10939 • Published May 16 • 1
ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge Paper • 2506.14407 • Published Jun 17 • 2
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence Paper • 2503.05037 • Published Mar 6 • 4
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence Paper • 2503.05037 • Published Mar 6 • 4
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory Paper • 2404.11672 • Published Apr 17, 2024
Consistent Document-Level Relation Extraction via Counterfactuals Paper • 2407.06699 • Published Jul 9, 2024
MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment Paper • 2410.05873 • Published Oct 8, 2024 • 3
GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers Paper • 2205.03286 • Published May 6, 2022
Not All Models Localize Linguistic Knowledge in the Same Place: A Layer-wise Probing on BERToids' Representations Paper • 2109.05958 • Published Sep 13, 2021