INDEX
Explanations
mentions of a specific person referred to as "Her"
occurrences of the pronoun "Her"
New Auto-Interp
Negative Logits
otation
-0.64
orientation
-0.63
ynski
-0.61
ype
-0.60
lockout
-0.58
uto
-0.57
CVE
-0.56
uate
-0.54
peak
-0.54
amping
-0.54
POSITIVE LOGITS
Her
3.46
Her
2.65
She
2.09
HER
1.97
She
1.62
her
1.54
her
1.50
herself
1.49
His
1.41
she
1.35
Activations Density 0.009%