INDEX
Explanations
mentions or references to individuals with 'ex' in front of their title or position
references to former or past individuals in various contexts
New Auto-Interp
Negative Logits
Zup
-0.77
shading
-0.73
uden
-0.73
kson
-0.73
eday
-0.72
otle
-0.71
intervals
-0.69
dances
-0.68
utical
-0.65
atche
-0.63
POSITIVE LOGITS
girlfriend
1.13
husband
1.04
imposed
0.99
member
0.96
president
0.93
turned
0.92
employ
0.92
commun
0.91
vict
0.90
may
0.90
Activations Density 0.024%