Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shensq0814 's Collections
Commonsense
Alignment
Prompting
LLM
Personalization

Alignment

updated Nov 16, 2023
Upvote
-

  • MART: Improving LLM Safety with Multi-round Automatic Red-Teaming

    Paper • 2311.07689 • Published Nov 13, 2023 • 9

  • Trusted Source Alignment in Large Language Models

    Paper • 2311.06697 • Published Nov 12, 2023 • 12

  • Unveiling Safety Vulnerabilities of Large Language Models

    Paper • 2311.04124 • Published Nov 7, 2023 • 10
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs