INDEX
Explanations
terms related to discrimination and extremism
negative sentiments related to various forms of discrimination and extremism
New Auto-Interp
Negative Logits
laure
-0.74
ĸļ
-0.73
precincts
-0.73
Anniversary
-0.71
curled
-0.70
scratch
-0.69
fold
-0.69
thumbnail
-0.69
Brus
-0.68
somew
-0.68
POSITIVE LOGITS
Semitic
1.68
Semitism
1.61
establishment
1.54
government
1.50
immigrant
1.49
democratic
1.45
social
1.44
capitalist
1.44
choice
1.39
immigration
1.35
Activations Density 0.022%