INDEX
Explanations
references to drug use and addiction
New Auto-Interp
Negative Logits
drugs
-0.23
alcohol
-0.22
booze
-0.20
antibiotics
-0.19
drunken
-0.19
drug
-0.18
drinking
-0.18
Drugs
-0.17
alcoholic
-0.17
whiskey
-0.17
POSITIVE LOGITS
purity
0.20
powder
0.18
slang
0.18
recre
0.17
paraph
0.17
street
0.17
street
0.17
nasal
0.17
Powder
0.16
recreational
0.16
Activations Density 0.053%