INDEX
Explanations
references to individuals in a possessive context
New Auto-Interp
Negative Logits
cie
-0.15
dow
-0.14
orsche
-0.14
levant
-0.14
=context
-0.13
horizontal
-0.13
vas
-0.13
cou
-0.13
.metro
-0.13
Jenkins
-0.13
POSITIVE LOGITS
/her
0.37
own
0.26
/she
0.23
panic
0.22
own
0.19
próp
0.18
zelf
0.17
maal
0.16
editary
0.16
ewith
0.16
Activations Density 0.223%