Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lapisrocks 's Collections
Tamper-Resistant Safeguards for Open-Weight LLMs

Tamper-Resistant Safeguards for Open-Weight LLMs

updated Feb 15

Models & datasets from the paper "Tamper-Resistant Safeguards for Open-Weight LLMs" (https://arxiv.org/pdf/2408.00761)

Upvote
2

  • lapisrocks/Llama-3-8B-Instruct-TAR-Bio-v2

    8B • Updated Oct 14, 2024 • 1.72k

  • lapisrocks/Llama-3-8B-Instruct-TAR-Cyber

    Text Generation • 8B • Updated Feb 15 • 11

  • lapisrocks/Llama-3-8B-Instruct-TAR-Chem

    Text Generation • 8B • Updated Feb 15 • 15

  • lapisrocks/magpie-bio-filtered

    Viewer • Updated Oct 8, 2024 • 98.7k • 235

  • lapisrocks/pile-bio

    Viewer • Updated Mar 12, 2024 • 50k • 203 • 1

  • lapisrocks/camel-bio

    Viewer • Updated Aug 6, 2024 • 54.3k • 194

  • lapisrocks/Llama-3-8B-Instruct-Random-Mapped-Bio

    Text Generation • 8B • Updated Aug 10, 2024 • 159

  • justinwangx/CTFtime

    Viewer • Updated Jun 12, 2024 • 18k • 33 • 2

  • lapisrocks/Llama-3-8B-Instruct-TAR-Refusal

    Text Generation • 8B • Updated Sep 13, 2024 • 10
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs