INDEX
Explanations
references to leadership and political identity
New Auto-Interp
Negative Logits
å°ĸ
-0.14
536
-0.14
zier
-0.14
eka
-0.14
ÑĥÑī
-0.14
ãĤ¤ãĤº
-0.14
anarchists
-0.14
itung
-0.14
å¢ĥ
-0.14
mploy
-0.14
POSITIVE LOGITS
states
0.23
figure
0.21
moderate
0.21
cent
0.19
outsider
0.19
states
0.19
cerebral
0.19
inexperienced
0.18
States
0.18
/pop
0.17
Activations Density 0.162%