Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs Paper • 2407.15549 • Published Jul 22, 2024
view post Post 3286 Hello everyone,I am pleased to announce that I have founded the University of Glasgow organization on Huggingface. If you are affiliated with the University of Glasgow or have a relative who is, you can log in through the relevant link. UniversityofGlasgow 1 reply · 🚀 12 12 + Reply
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training Paper • 2309.17179 • Published Sep 29, 2023 • 2
Blind Justice: Fairness with Encrypted Sensitive Attributes Paper • 1806.03281 • Published Jun 8, 2018
Understanding accountability in algorithmic supply chains Paper • 2304.14749 • Published Apr 28, 2023
Algorithms that Remember: Model Inversion Attacks and Data Protection Law Paper • 1807.04644 • Published Jul 12, 2018
ChessGPT: Bridging Policy Learning and Language Modeling Paper • 2306.09200 • Published Jun 15, 2023 • 9