INDEX
Explanations
words related to undercover operations or deception
instances of the word "sting" and related terms associated with sting operations
New Auto-Interp
Negative Logits
ufact
-0.77
NCT
-0.73
ocally
-0.66
Springer
-0.65
ACTED
-0.64
Supplement
-0.63
iasco
-0.62
uters
-0.61
ordan
-0.60
Britann
-0.59
POSITIVE LOGITS
sting
1.34
rays
1.10
ray
1.09
ega
1.03
Ray
0.83
lers
0.83
ingly
0.80
Sting
0.79
iest
0.78
ritch
0.78
Activations Density 0.010%