INDEX
Explanations
terms related to right-wing political ideologies
references to right-wing politics
New Auto-Interp
Negative Logits
ADRA
-0.66
Remastered
-0.65
Aires
-0.65
Takeru
-0.64
Qiao
-0.64
Mats
-0.64
Guan
-0.63
DAQ
-0.63
McGee
-0.61
PID
-0.61
POSITIVE LOGITS
wing
1.49
wing
1.31
ists
1.11
leaning
1.10
Wing
1.09
flank
1.08
ist
1.05
liners
0.96
lean
0.94
ervative
0.93
Activations Density 0.050%