INDEX
Explanations
terms related to extremist groups and ideologies, including white supremacists, neo-Nazis, and fascist movements
New Auto-Interp
Negative Logits
anwhile
-0.64
Rica
-0.59
Tiff
-0.56
AVG
-0.56
++++++++++++++++
-0.54
ovember
-0.53
pload
-0.52
pport
-0.51
Effective
-0.51
channelAvailability
-0.50
POSITIVE LOGITS
fascist
0.74
colonial
0.72
liberal
0.71
social
0.67
present
0.67
Nazi
0.67
natal
0.66
aligned
0.64
capitalist
0.64
Georg
0.63
Activations Density 8.266%