Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xuankun Rong's picture
3 18 2

Xuankun Rong

XuankunRong
renjiepi's profile picture winzheng's profile picture WilliamHuang91's profile picture
·
https://xuankunrong.github.io/
  • XuankunRong

AI & ML interests

AI Safety

Recent Activity

new activity 1 day ago
XuankunRong/SafeTag-VL-3K:Add task category, paper, and code links to dataset card
upvoted a paper 16 days ago
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening
upvoted a paper 22 days ago
THINKSAFE: Self-Generated Safety Alignment for Reasoning Models
View all activity

Organizations

None yet

New activity in XuankunRong/SafeTag-VL-3K 1 day ago

Add task category, paper, and code links to dataset card

#2 opened 3 months ago by
nielsr
commented a paper 3 months ago

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Paper • 2511.12982 • Published Nov 17, 2025 • 4 •
2
commented a paper 9 months ago

Backdoor Cleaning without External Guidance in MLLM Fine-tuning

Paper • 2505.16916 • Published May 22, 2025 • 17 •
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs