INDEX
Explanations
mentions of consuming a beverage, specifically drinking wine
references to drinking beverages, particularly alcohol
New Auto-Interp
Negative Logits
pora
-0.72
eele
-0.72
ural
-0.71
theless
-0.70
orp
-0.70
moving
-0.68
Sharif
-0.66
eq
-0.65
sure
-0.64
asia
-0.63
POSITIVE LOGITS
bottles
1.00
alcohol
0.96
drinking
0.96
cohol
0.91
gallons
0.91
beverages
0.90
water
0.89
soda
0.88
Drink
0.87
glasses
0.87
Activations Density 0.020%