INDEX
Explanations
words related to political ideologies, particularly forms of socialism and anarchism
New Auto-Interp
Negative Logits
McDon
-0.70
unca
-0.61
InputDecoration
-0.61
e
-0.57
ANCA
-0.55
Huck
-0.55
----</
-0.55
AJAS
-0.54
eX
-0.54
Autoritní
-0.53
POSITIVE LOGITS
ist
2.92
IST
2.50
ists
2.46
ISTS
2.24
tist
1.74
rist
1.60
alist
1.57
istes
1.51
istin
1.51
cist
1.51
Activations Density 0.101%