INDEX
Explanations
mentions of illegal drugs
references to drugs
New Auto-Interp
Negative Logits
caster
-0.77
Glacier
-0.74
estamp
-0.72
intosh
-0.71
Gir
-0.71
leigh
-0.70
ducks
-0.69
Upload
-0.67
ovsky
-0.67
angelo
-0.66
POSITIVE LOGITS
drug
3.54
Drug
2.87
drug
2.87
drugs
2.72
Drug
2.72
Drugs
2.38
narcotics
2.18
cocaine
1.96
narc
1.95
heroin
1.95
Activations Density 0.026%