INDEX
Explanations
words related to political and social ideologies
terms related to various political ideologies
New Auto-Interp
Negative Logits
tesy
-0.78
ecause
-0.74
accompanied
-0.71
shown
-0.67
speedy
-0.67
FACE
-0.65
ilings
-0.64
ruciating
-0.63
upon
-0.63
perty
-0.63
POSITIVE LOGITS
geist
0.89
rist
0.88
ist
0.87
ess
0.83
oad
0.81
opol
0.80
ists
0.79
essed
0.77
tendencies
0.76
otle
0.76
Activations Density 0.028%