INDEX
Explanations
references to alcohol consumption and its effects on behavior
New Auto-Interp
Negative Logits
Moisture
-0.50
Viitteet
-0.49
ورك
-0.48
inchen
-0.47
snippetHide
-0.46
ündig
-0.44
tro
-0.43
كره
-0.43
yenne
-0.43
లాలు
-0.43
POSITIVE LOGITS
drunk
0.98
drunken
0.92
alcoholic
0.91
alcohol
0.90
alkoh
0.89
drunk
0.87
Drunk
0.86
booze
0.83
alcohol
0.82
Alcohol
0.81
Activations Density 0.195%