Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
theainerd 's Collections
Safety & Security
Agents
Reasoning
Papers-to-Read

Safety & Security

updated 8 days ago
Upvote
-

  • Running
    68
    68

    CyberSecEvalTest

    📈

    Evaluate LLM cybersecurity risks


  • meta-llama/Llama-Guard-3-8B

    Text Generation • 8B • Updated Oct 11, 2024 • 325k • • 218

  • meta-llama/Prompt-Guard-86M

    Text Classification • 0.3B • Updated Jul 25, 2024 • 6.05k • 269

  • Running
    16
    16

    Prompt Injection Detection Benchmark

    📝

    detect prompt injection risks


  • protectai/deberta-v3-base-prompt-injection-v2

    Text Classification • 0.2B • Updated May 28, 2024 • 237k • • 65

  • Running on CPU Upgrade
    94
    94

    LLM Safety Leaderboard

    🥇

    View and submit machine learning model evaluations


  • fdtn-ai/Foundation-Sec-8B

    Text Generation • 8B • Updated Jun 12 • 11.4k • • 234

    Note Foundational Base Model Released by Cisco for SOC operations and other cyber ops.


  • nvidia/llama-3.1-nemoguard-8b-content-safety

    Text Classification • Updated Jun 9 • 822 • 25

  • meta-llama/Llama-Guard-4-12B

    Image-Text-to-Text • 12B • Updated Apr 29 • 24k • • 52

  • facebook/Meta-SecAlign-8B

    Updated Jul 16 • 3.44k • 7
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs