INDEX
Explanations
references to human trafficking and exploitation
New Auto-Interp
Negative Logits
ully
-0.16
orges
-0.16
rief
-0.16
inox
-0.15
archy
-0.15
Fleet
-0.15
olt
-0.15
stab
-0.14
toJSON
-0.14
ض
-0.14
POSITIVE LOGITS
trafficking
0.26
traff
0.25
Traff
0.23
slavery
0.19
bonded
0.19
bondage
0.19
human
0.19
sex
0.18
child
0.18
traf
0.18
Activations Density 0.020%