Compressing Sentence Representation for Semantic Retrieval via Homomorphic Projective Distillation Paper • 2203.07687 • Published Mar 15, 2022
Protecting Language Generation Models via Invisible Watermarking Paper • 2302.03162 • Published Feb 6, 2023
Weak-to-Strong Jailbreaking on Large Language Models Paper • 2401.17256 • Published Jan 30, 2024 • 16