INDEX
Explanations
phrases related to political controversy and conflict
references to political conflict and personal attacks
New Auto-Interp
Negative Logits
BuyableInstoreAndOnline
-0.78
emonium
-0.73
Nanto
-0.66
iHUD
-0.65
ARB
-0.62
ITNESS
-0.61
juggling
-0.61
shuffle
-0.61
Boom
-0.60
uned
-0.59
POSITIVE LOGITS
indiscrim
0.96
unfairly
0.93
unnecessarily
0.93
lest
0.89
because
0.89
unjust
0.87
outright
0.85
selves
0.83
by
0.83
innoc
0.83
Activations Density 0.370%