INDEX
Explanations
language related to social or political issues and statements condemning violence or unethical behavior
expressions of condemnation or denunciation regarding violence and human rights abuses
New Auto-Interp
Negative Logits
ortality
-0.82
©¶æ¥µ
-0.79
retirees
-0.76
luck
-0.74
igree
-0.73
Quartz
-0.71
soDeliveryDate
-0.70
Jennings
-0.69
Gins
-0.69
noticed
-0.69
POSITIVE LOGITS
hateful
1.35
intolerance
1.29
bigotry
1.27
discriminatory
1.27
unlawful
1.25
bullying
1.23
hatred
1.19
slander
1.18
malicious
1.18
misuse
1.17
Activations Density 0.503%