INDEX
Explanations
instances of reported speech, particularly the word "said" and its variations
New Auto-Interp
Negative Logits
boycot
-0.80
nonviolent
-0.79
extremist
-0.77
satire
-0.76
extremists
-0.76
liberals
-0.71
cler
-0.70
bombard
-0.70
prosecutions
-0.69
rage
-0.67
POSITIVE LOGITS
":["
1.02
ascript
0.69
NBA
0.69
backstage
0.68
"â̦
0.66
Asked
0.66
æĪ¦
0.65
Regarding
0.65
ãĢĮ
0.65
yne
0.64
Activations Density 0.400%