INDEX
Explanations
proper names, possibly associated with criminal activity
proper nouns, particularly names
New Auto-Interp
Negative Logits
itarian
-0.68
corrid
-0.67
acious
-0.67
meric
-0.64
therape
-0.64
Moroc
-0.63
MAT
-0.60
ween
-0.60
unden
-0.59
mathemat
-0.58
POSITIVE LOGITS
igham
0.88
igger
0.79
LLP
0.74
ucker
0.73
Township
0.70
umber
0.69
patrick
0.69
son
0.67
mare
0.67
Dent
0.66
Activations Density 0.125%