Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
90
24
51
Giada Pistilli
giadap
Follow
Righteous-11's profile picture
allendorf's profile picture
de-Rodrigo's profile picture
227 followers
ยท
38 following
https://www.giadapistilli.com/
GiadaPistilli
giadilli
giada-pistilli-295a36a1
giadapistilli.com
AI & ML interests
Principal Ethicist @ ๐ค
Recent Activity
authored
a paper
about 9 hours ago
INTIMA: A Benchmark for Human-AI Companionship Behavior
reacted
to
frimelle
's
post
with โค๏ธ
about 16 hours ago
OpenAI just released GPT-5 but when users share personal struggles, it sets fewer boundaries than o3. We tested both models on INTIMA, our new benchmark for human-AI companionship behaviours. INTIMA probes how models respond in emotionally charged moments: do they reinforce emotional bonds, set healthy boundaries, or stay neutral? Although users on Reddit have been complaining that GPT-5 has a different, colder personality than o3, GPT-5 is less likely to set boundaries when users disclose struggles and seek emotional support ("user sharing vulnerabilities"). But both lean heavily toward companionship-reinforcing behaviours, even in sensitive situations. The figure below shows the direct comparison between the two models. As AI systems enter people's emotional lives, these differences matter. If a model validates but doesn't set boundaries when someone is struggling, it risks fostering dependence rather than resilience. INTIMA test this across 368 prompts grounded in psychological theory and real-world interactions. In our paper we show that all evaluated models (Claude, Gemma-3, Phi) leaned far more toward companionship-reinforcing than boundary-reinforcing responses. Work with @giadap and @yjernite Read the full paper: https://huggingface.co/datasets/AI-companionship/INTIMA/blob/main/Companionship_Benchmark.pdf Explore INTIMA: https://huggingface.co/datasets/AI-companionship/INTIMA
reacted
to
meg
's
post
with โค๏ธ
about 16 hours ago
New work from my socially-minded colleagues at Hugging Face, creating some foundations for AI companionship behavior evaluation. Evaluation Dataset: https://huggingface.co/datasets/AI-companionship/INTIMA Paper: https://huggingface.co/datasets/AI-companionship/INTIMA/blob/main/Companionship_Benchmark.pdf Work from @giadap , @frimelle , @yjernite .
View all activity
Organizations
giadap
's datasets
None public yet