INDEX
Explanations
phrases related to drinking activities
New Auto-Interp
Negative Logits
Sharif
-0.77
ural
-0.75
notch
-0.68
theless
-0.66
ominated
-0.65
Warfare
-0.64
asia
-0.62
Tur
-0.61
router
-0.61
Postal
-0.61
POSITIVE LOGITS
cohol
1.09
bottles
1.08
alcohol
1.04
water
1.03
drinking
0.99
bott
0.97
beverages
0.96
drinkers
0.92
bottle
0.92
gallons
0.92
Activations Density 0.084%