INDEX
Explanations
words related to police operations or criminal activities
references to sting operations in law enforcement contexts
New Auto-Interp
Negative Logits
ufact
-0.81
Supplement
-0.69
iasco
-0.67
ordan
-0.64
ocally
-0.63
Princ
-0.62
NCT
-0.61
ACTED
-0.61
uclear
-0.60
uters
-0.59
POSITIVE LOGITS
sting
1.36
rays
1.05
ray
1.03
ega
0.91
Sting
0.86
Ray
0.82
tip
0.78
elope
0.78
ingly
0.78
iness
0.75
Activations Density 0.005%