INDEX
Explanations
references to political ideologies or movements associated with the term "neo."
references to neo-Nazi ideology and groups
New Auto-Interp
Negative Logits
ILCS
-0.89
ORED
-0.80
ãĤ¼ãĤ¦ãĤ¹
-0.79
Interstitial
-0.79
loo
-0.79
hips
-0.79
worthiness
-0.77
channelAvailability
-0.77
lessly
-0.73
ULL
-0.72
POSITIVE LOGITS
ge
0.95
neo
0.76
flex
0.75
emer
0.73
-
0.73
azi
0.73
fer
0.73
chal
0.71
mascul
0.70
fascist
0.69
Activations Density 0.009%