INDEX
Explanations
words related to political ideologies
references to right-wing political affiliations or ideologies
New Auto-Interp
Negative Logits
Remastered
-0.70
cellul
-0.68
DAQ
-0.67
atable
-0.67
arette
-0.67
è¦ļéĨĴ
-0.66
arettes
-0.65
ulative
-0.64
Mehran
-0.62
aples
-0.61
POSITIVE LOGITS
eous
1.35
wing
1.07
wing
1.06
winger
0.99
flank
0.94
fielder
0.92
ward
0.90
move
0.83
handed
0.83
most
0.78
Activations Density 0.053%