INDEX
Explanations
names or descriptions related to cocktails
references to cocktails and mixed drinks
New Auto-Interp
Negative Logits
ħ
-0.88
Prev
-0.85
ership
-0.73
Prev
-0.70
teness
-0.69
uthor
-0.67
nces
-0.66
ths
-0.65
Kun
-0.65
Whe
-0.65
POSITIVE LOGITS
cocktail
3.83
cocktails
3.28
Cock
1.62
bartender
1.56
gin
1.55
vodka
1.53
bart
1.52
drinks
1.43
conco
1.42
brunch
1.40
Activations Density 0.017%