INDEX
Explanations
references to drinking and consumption of alcohol
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.08
3:0.06
4:0.09
5:0.03
6:0.02
7:0.34
8:0.03
9:0.03
10:0.20
11:0.03
Negative Logits
traverse
-1.87
onomous
-1.86
geographically
-1.81
navigating
-1.77
navigate
-1.75
Galile
-1.72
navigation
-1.70
apter
-1.67
Route
-1.63
Filename
-1.62
POSITIVE LOGITS
champagne
2.02
discount
1.83
candles
1.82
Dollars
1.77
bottles
1.70
coupons
1.70
spiked
1.69
bol
1.69
tea
1.67
skept
1.67
Activations Density 0.000%