INDEX
Explanations
references to specific roles or identities of individuals within a narrative context
New Auto-Interp
Negative Logits
próp
-0.23
own
-0.22
same
-0.22
propio
-0.18
mesma
-0.18
Own
-0.18
propia
-0.16
èĩªå·±çļĦ
-0.16
Own
-0.16
stesso
-0.16
POSITIVE LOGITS
es
0.21
idade
0.20
dess
0.18
ha
0.17
ión
0.17
as
0.17
ed
0.15
chaft
0.15
ivalence
0.15
os
0.14
Activations Density 0.032%