INDEX
Explanations
alcohol-related words, particularly focusing on specific types such as vodka, tequila, gin, and bourbon
mentions of alcoholic beverages
New Auto-Interp
Negative Logits
Tracking
-0.72
Printed
-0.69
Parenthood
-0.68
Pages
-0.67
Payton
-0.67
NCT
-0.66
Defin
-0.64
Tur
-0.64
Behavior
-0.63
Postal
-0.63
POSITIVE LOGITS
vodka
1.17
whisky
1.09
gin
1.08
odka
1.04
whiskey
1.03
quila
0.91
wine
0.91
cocktails
0.89
vinegar
0.89
cocktail
0.86
Activations Density 0.016%