INDEX
Explanations
terms associated with political ideologies, particularly those relating to socialism and conservatism
New Auto-Interp
Negative Logits
iless
-0.17
pez
-0.15
USR
-0.15
warz
-0.15
elon
-0.14
scape
-0.14
358
-0.14
LESS
-0.14
meno
-0.14
igit
-0.14
POSITIVE LOGITS
-leaning
0.39
lean
0.32
leaning
0.32
leaning
0.30
tendencies
0.26
sentiments
0.23
sentiment
0.23
-minded
0.23
/left
0.23
sympath
0.23
Activations Density 0.113%