Xuankun Rong's picture

Xuankun Rong

XuankunRong

·

https://xuankunrong.github.io/

XuankunRong

AI & ML interests

AI Safety

Recent Activity

new activity 1 day ago

XuankunRong/SafeTag-VL-3K:Add task category, paper, and code links to dataset card

upvoted a paper 16 days ago

Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening

upvoted a paper 22 days ago

THINKSAFE: Self-Generated Safety Alignment for Reasoning Models

View all activity

Organizations

None yet

New activity in XuankunRong/SafeTag-VL-3K 1 day ago

Add task category, paper, and code links to dataset card

#2 opened 3 months ago by

commented a paper 3 months ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Paper • 2511.12982 • Published Nov 17, 2025 • 4 •

commented a paper 9 months ago

Backdoor Cleaning without External Guidance in MLLM Fine-tuning

Paper • 2505.16916 • Published May 22, 2025 • 17 •