INDEX
Explanations
phrases or words related to controversial remarks or statements
comments and discussions about political figures and their controversial statements
New Auto-Interp
Negative Logits
neau
-0.80
inct
-0.78
fen
-0.71
Sensor
-0.71
ept
-0.69
lite
-0.69
bda
-0.68
eworks
-0.68
CLSID
-0.68
aic
-0.67
POSITIVE LOGITS
Mexicans
0.99
himself
0.99
homosexuals
0.93
gays
0.91
insulting
0.89
homosexuality
0.87
dispar
0.83
Adolf
0.81
apologizing
0.79
Muslims
0.78
Activations Density 0.417%