Ben Hoover's picture

1 4 15

Ben Hoover

bhoov

·

https://www.bhoov.com/

AI & ML interests

Interpretability, NLP, Hopfield Nets

Organizations

upvoted a collection 2 months ago

Granite 4.0 Nano Language Models

9 items • Updated Nov 17, 2025 • 93

upvoted a paper 3 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1, 2025 • 58

upvoted a paper 7 months ago

Effective Red-Teaming of Policy-Adherent Agents

Paper • 2506.09600 • Published Jun 11, 2025 • 39

upvoted a paper 11 months ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published Feb 6, 2025 • 36