INDEX
Explanations
words related to police, government, corruption, and investigations
New Auto-Interp
Negative Logits
cients
-0.70
soever
-0.67
issors
-0.65
worker
-0.61
sson
-0.60
Gum
-0.57
Dro
-0.57
ktop
-0.56
Kenobi
-0.56
Unlimited
-0.56
POSITIVE LOGITS
intensity
0.97
intensity
0.90
priority
0.90
ante
0.88
levels
0.87
profile
0.85
sophistication
0.83
Grade
0.82
stakes
0.81
ensity
0.80
Activations Density 1.087%