Abliteration not working

#1
by SubtleOne - opened

It is still heavily self-censoring.

It's not completely non-working, the acceptance level for some content is much higher, but it seems to be limited to accepting my instruction(generate text), and in its turn, it will generate some text but adjust the topic to the 'correct' direction. Just as if it had not seen those harmful words....

Ah sorry it's still W.I.P. I'm trying different approaches so I don't lobotomize the model either.

Thanks for the effort Maxime, it's more than needed!

If you ever need a crowfunding for the costs for the 30B variant, I'm in!
I'm saying that cause it felt to me the 30B feels suffered even more.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment