INDEX
Explanations
keywords related to incitement or provocation
words related to incitement and provocation of violence or unrest
New Auto-Interp
Negative Logits
ellipt
-0.79
rieve
-0.70
neau
-0.69
rost
-0.66
otiation
-0.65
aird
-0.63
Garrison
-0.63
usterity
-0.61
Eucl
-0.61
erm
-0.61
POSITIVE LOGITS
inciting
1.01
sidx
1.01
itement
0.93
xual
0.88
incite
0.87
ãĥĥ
0.86
ãĥ¼ãĥĨãĤ£
0.81
ãĥŁ
0.78
TextColor
0.78
stoked
0.77
Activations Density 0.036%