INDEX
Explanations
terms related to illegal activities, specifically trafficking
terms related to human trafficking and familial relationships
New Auto-Interp
Negative Logits
isure
-0.72
lihood
-0.71
Scotia
-0.67
vere
-0.65
OPLE
-0.65
Liberty
-0.64
Bie
-0.64
bris
-0.63
detachment
-0.63
ories
-0.62
POSITIVE LOGITS
traffickers
1.12
traff
1.09
pim
0.99
icked
0.89
exploited
0.84
abuser
0.83
appers
0.80
rapist
0.77
ynt
0.76
abusers
0.76
Activations Density 0.013%