INDEX
Explanations
verbs related to threatening actions
phrases indicating threats or aggressive actions
New Auto-Interp
Negative Logits
Parables
-0.77
guiName
-0.67
CTV
-0.66
Display
-0.65
oret
-0.65
ellen
-0.63
worn
-0.62
Bought
-0.62
Videos
-0.61
incorporated
-0.60
POSITIVE LOGITS
retaliation
1.03
retribution
0.99
repr
0.84
retaliate
0.82
boycott
0.78
harming
0.77
eviction
0.74
annihilation
0.74
intimidation
0.73
wrath
0.73
Activations Density 0.104%