INDEX
Explanations
references to drinks, specifically alcoholic beverages
terms related to drinking water
New Auto-Interp
Negative Logits
Sharif
-0.86
notch
-0.73
cephal
-0.65
Parenthood
-0.63
asia
-0.61
router
-0.60
Postal
-0.60
Payton
-0.60
ansion
-0.60
":["
-0.59
POSITIVE LOGITS
cohol
1.22
water
1.17
alcohol
1.13
bottles
1.12
beverages
1.07
brewed
1.05
drinking
1.04
drinkers
1.03
beverage
0.99
water
0.99
Activations Density 0.053%