INDEX
Explanations
terms related to police tactical units, specifically the term "SWAT" along with words related to restraining actions
references to law enforcement, particularly relating to SWAT teams and their actions
New Auto-Interp
Negative Logits
士
-0.93
Minotaur
-0.66
omission
-0.60
Daughter
-0.60
infeld
-0.60
antidepressants
-0.58
fixation
-0.58
Reviewer
-0.57
Ô
-0.57
righteous
-0.57
POSITIVE LOGITS
swat
1.27
ting
1.14
eenth
0.92
efully
0.89
estic
0.89
apon
0.85
stakes
0.84
apons
0.83
hes
0.83
tle
0.81
Activations Density 0.007%