INDEX
Explanations
political and social controversy, particularly regarding racial and gender issues
New Auto-Interp
Negative Logits
progressive
-0.18
Progressive
-0.16
queer
-0.16
osti
-0.15
akis
-0.15
oplast
-0.14
Ñģи
-0.14
Frid
-0.14
progressives
-0.14
oa
-0.14
POSITIVE LOGITS
Natural
0.17
itbart
0.17
natural
0.16
liberty
0.16
Liberty
0.16
Natural
0.16
itage
0.14
Breitbart
0.14
Judge
0.14
.encoding
0.14
Activations Density 0.464%