Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
thu-ml
's Collections
STAIR
STAIR
updated
Feb 26
Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)
Upvote
1
thu-ml/STAIR-Llama-3.1-8B-SFT
Text Generation
•
Updated
Feb 25
•
18
thu-ml/STAIR-Qwen2-7B-SFT
Text Generation
•
Updated
Feb 25
•
21
•
1
thu-ml/STAIR-SFT
Viewer
•
Updated
Feb 25
•
20k
•
125
thu-ml/STAIR-Prompts
Viewer
•
Updated
Feb 25
•
63k
•
67
STAIR: Improving Safety Alignment with Introspective Reasoning
Paper
•
2502.02384
•
Published
Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3
Text Generation
•
Updated
Feb 26
•
11
•
1
thu-ml/STAIR-Llama-3.1-8B-DPO-3
Text Generation
•
Updated
Feb 26
•
8
Upvote
1
Share collection
View history
Collection guide
Browse collections