INDEX
Explanations
words related to conflicts or standoffs
terms related to violent confrontations or conflicts
New Auto-Interp
Negative Logits
odor
-0.84
urers
-0.70
uries
-0.69
ringe
-0.68
oral
-0.68
incarn
-0.66
yi
-0.66
rates
-0.66
ools
-0.65
broad
-0.65
POSITIVE LOGITS
standoff
1.32
SWAT
0.82
Bundy
0.72
halla
0.71
Shooter
0.71
selfies
0.66
saga
0.66
roy
0.66
emate
0.66
ishly
0.65
Activations Density 0.033%