Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bartosz Cywiński's picture
3 9 25

Bartosz Cywiński

bcywinski
Marek4321's profile picture aszokalski's profile picture ssaha3umbc's profile picture
·
https://cywinski.github.io/
  • bartoszcyw
  • cywinski

AI & ML interests

Mechanistic Interpretability

Recent Activity

updated a model 4 days ago
bcywinski/gemma-2-9b-it-occupation-doctor
published a model 4 days ago
bcywinski/gemma-2-9b-it-occupation-doctor
updated a model 4 days ago
bcywinski/llama-3.1-8b-instruct-taboo-blue
View all activity

Organizations

None yet

authored a paper 3 months ago

Eliciting Secret Knowledge from Language Models

Paper • 2510.01070 • Published Oct 1 • 5
authored a paper 7 months ago

Towards eliciting latent knowledge from LLMs with mechanistic interpretability

Paper • 2505.14352 • Published May 20 • 9
authored a paper 10 months ago

Precise Parameter Localization for Textual Generation in Diffusion Models

Paper • 2502.09935 • Published Feb 14 • 12
authored a paper 11 months ago

SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders

Paper • 2501.18052 • Published Jan 29 • 8
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs