AI & ML interests
Research Demos and Tools for Trustworthy and Safe AI Development and Deployment
Recent Activity
View all activity
Organization Card
- Welcome to TrustSafeAI! We are a reseach group focusing on evaluating and improving AI safety.
- If you are interested in joining us, please reach out to Pin-Yu Chen
- Team Members and Projects:
Member | Project | Webpage |
---|---|---|
Xiaomeng Hu | RADAR (NeurIPS'23), Gradient Cuff (NeurIPS'24) | webpage |
Lei Hsiung | NeuralFuse (NeurIPS'24), NCTV (TMLR; IJCAI'23) | webpage |
Zhi-Yi Chin | P4D (ICML'24) | webpage |
Barry Xiong | DPP | - |
Johnson Hung | AttentionTracker | - |
Zaitang Li | GREAT Score (NeurIPS'24) | - |
Yung-Chen Tang | NCTV (TMLR; IJCAI'23) , LLM-Physical-Safety | webpage |
Zhiyuan He | BEYOND (ICML'24) | - |
Yujun Zhou | LLM LabSafety | - |
Xiangyu Qi | LLM Finetuning Safety (ICLR'24) | webpage |
Pin-Yu Chen | All (research supervisor) | webpage |
spaces
9
Running
3
Attention Tracker Prompt Injection Detector
⚡
Attention Tracker: Prompt Injection Detector
Running
Retention Score
🧠
Evaluate jailbreak risks for Vision-Language Models
Running
LLM Physical Safety
👀
LLM benchmark for Physical Safety
Running
NeuralFuse
⚡
Protect Model from Suffering Low-voltage-induced Bit Errors
Running
3
NCTV: Neural Clamping Toolkit and Visualization
🦀
Model-agnostic Toolkit for Neural Network Calibration
Running
GREAT Score
🧠
Evaluate adversarial robustness using generative models