INDEX
Explanations
words related to illegal activities, specifically smuggling and contraband
references to contraband and related illegal activities
New Auto-Interp
Negative Logits
tes
-0.85
#$
-0.83
eer
-0.80
Communities
-0.77
STATE
-0.74
BRE
-0.72
lishing
-0.70
town
-0.69
bra
-0.68
oÄŁ
-0.67
POSITIVE LOGITS
aband
0.88
arians
0.88
iquette
0.82
ribut
0.81
oxin
0.74
iary
0.73
uner
0.73
iger
0.72
illon
0.70
ihar
0.69
Activations Density 0.046%