INDEX
Explanations
phrases related to political affiliations, specifically referring to right-wing ideologies
references to "right-wing" political ideologies or groups
New Auto-Interp
Negative Logits
TTL
-0.67
BILITIES
-0.67
WARRANT
-0.64
Interstitial
-0.61
æĹ
-0.59
Sharing
-0.58
FULL
-0.57
possible
-0.57
Sending
-0.57
CODE
-0.56
POSITIVE LOGITS
wing
1.37
tip
1.08
nuts
0.89
een
0.86
nut
0.85
tips
0.83
edly
0.83
wich
0.82
ucer
0.80
ritch
0.78
Activations Density 0.011%