Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Paper โข 2505.24523 โข Published 9 days ago โข 8
๐ Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized โข 116 items โข Updated 2 days ago โข 105