INDEX
Explanations
references to political figures and their administrations
New Auto-Interp
Negative Logits
edly
-0.14
akeup
-0.14
AGO
-0.14
usu
-0.14
rances
-0.14
592
-0.14
itself
-0.14
rance
-0.13
æ°
-0.13
oner
-0.13
POSITIVE LOGITS
-era
0.28
omics
0.28
administration
0.27
Administration
0.27
ites
0.26
supporters
0.25
Era
0.24
ite
0.24
supporter
0.23
ista
0.23
Activations Density 0.095%