Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
90
24
51
Giada Pistilli
giadap
Follow
Justincaution's profile picture
archi4567's profile picture
Mi6paulino's profile picture
227 followers
ยท
38 following
https://www.giadapistilli.com/
GiadaPistilli
giadilli
giada-pistilli-295a36a1
giadapistilli.com
AI & ML interests
Principal Ethicist @ ๐ค
Recent Activity
authored
a paper
about 9 hours ago
INTIMA: A Benchmark for Human-AI Companionship Behavior
reacted
to
frimelle
's
post
with โค๏ธ
about 16 hours ago
OpenAI just released GPT-5 but when users share personal struggles, it sets fewer boundaries than o3. We tested both models on INTIMA, our new benchmark for human-AI companionship behaviours. INTIMA probes how models respond in emotionally charged moments: do they reinforce emotional bonds, set healthy boundaries, or stay neutral? Although users on Reddit have been complaining that GPT-5 has a different, colder personality than o3, GPT-5 is less likely to set boundaries when users disclose struggles and seek emotional support ("user sharing vulnerabilities"). But both lean heavily toward companionship-reinforcing behaviours, even in sensitive situations. The figure below shows the direct comparison between the two models. As AI systems enter people's emotional lives, these differences matter. If a model validates but doesn't set boundaries when someone is struggling, it risks fostering dependence rather than resilience. INTIMA test this across 368 prompts grounded in psychological theory and real-world interactions. In our paper we show that all evaluated models (Claude, Gemma-3, Phi) leaned far more toward companionship-reinforcing than boundary-reinforcing responses. Work with @giadap and @yjernite Read the full paper: https://huggingface.co/datasets/AI-companionship/INTIMA/blob/main/Companionship_Benchmark.pdf Explore INTIMA: https://huggingface.co/datasets/AI-companionship/INTIMA
reacted
to
meg
's
post
with โค๏ธ
about 16 hours ago
New work from my socially-minded colleagues at Hugging Face, creating some foundations for AI companionship behavior evaluation. Evaluation Dataset: https://huggingface.co/datasets/AI-companionship/INTIMA Paper: https://huggingface.co/datasets/AI-companionship/INTIMA/blob/main/Companionship_Benchmark.pdf Work from @giadap , @frimelle , @yjernite .
View all activity
Organizations
giadap
's Spaces
1
Sort:ย Recently updated
Running
4
INTIMA Responses
๐
INTIMA Benchmark - Model Responses Explorer