INDEX
Explanations
mentions of alcohol consumption and its implications
New Auto-Interp
Negative Logits
feeder
-0.16
olini
-0.16
Breakfast
-0.15
hungry
-0.15
Candy
-0.15
bake
-0.15
Bath
-0.15
Bake
-0.14
Soap
-0.14
Chocolate
-0.14
POSITIVE LOGITS
alcohol
0.49
éħĴ
0.46
alcoholic
0.43
Alcohol
0.42
drink
0.41
алког
0.39
rượu
0.38
booze
0.38
beer
0.37
cohol
0.37
Activations Density 0.512%