INDEX
Explanations
words related to criminal activities and justice
New Auto-Interp
Negative Logits
pole
-0.74
arity
-0.73
Pixie
-0.71
rish
-0.71
aeda
-0.68
pora
-0.68
por
-0.66
ioned
-0.66
pread
-0.65
tem
-0.64
POSITIVE LOGITS
ously
0.98
convictions
0.96
mastermind
0.95
punishable
0.91
victim
0.91
ous
0.91
spree
0.90
prosecutions
0.87
unfocusedRange
0.85
underworld
0.84
Activations Density 1.981%