INDEX
Explanations
references to former political figures and officials
New Auto-Interp
Negative Logits
former
-0.20
former
-0.18
Former
-0.18
formerly
-0.17
Former
-0.17
缮åīį
-0.16
/he
-0.16
yy
-0.15
older
-0.15
uck
-0.15
POSITIVE LOGITS
/current
0.34
/original
0.19
odus
0.19
Yugoslavia
0.18
/new
0.18
ly
0.17
employees
0.16
asper
0.16
LY
0.16
ucha
0.16
Activations Density 0.048%