Palisade Research (Collaborators)

Team

non-profit

Verified

https://palisaderesearch.org

palisadeai

PalisadeResearch

AI & ML interests

None defined yet.

Recent Activity

lindapetrini authored a paper about 1 month ago

Inverse Scaling in Test-Time Compute

Reworr-R updated a dataset 6 months ago

palisaderesearch/LLM-Honeypot-Logs

dmitrii-palisaderesearch authored a paper 11 months ago

Badllama 3: removing safety finetuning from Llama 3 in minutes

View all activity

lindapetrini

authored a paper about 1 month ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19 • 27

Reworr-R

updated a dataset 6 months ago

palisaderesearch/LLM-Honeypot-Logs

Updated about 6 hours ago • 39 • 4

dmitrii-palisaderesearch

authored a paper 11 months ago

Badllama 3: removing safety finetuning from Llama 3 in minutes

Paper • 2407.01376 • Published Jul 1, 2024

AdamGleave

authored a paper over 1 year ago

Exploiting Novel GPT-4 APIs

Paper • 2312.14302 • Published Dec 21, 2023 • 14

jladish

authored a paper almost 2 years ago

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Paper • 2310.20624 • Published Oct 31, 2023 • 13

oserikov

authored a paper almost 2 years ago

The Languini Kitchen: Enabling Language Modelling Research at Different Scales of Compute

Paper • 2309.11197 • Published Sep 20, 2023 • 5

AdamGleave

authored 2 papers about 2 years ago

Adversarial Policies Beat Superhuman Go AIs

Paper • 2211.00241 • Published Nov 1, 2022

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Paper • 2203.07475 • Published Mar 14, 2022