INDEX
Explanations
mentions of coffee-related words
references to coffee and coffee-related establishments
New Auto-Interp
Negative Logits
ROR
-0.86
oppable
-0.72
Caldwell
-0.71
yss
-0.70
orse
-0.68
Filename
-0.65
avid
-0.63
alez
-0.61
etric
-0.61
ukong
-0.61
POSITIVE LOGITS
beans
1.21
Beans
1.05
bean
1.00
brewed
0.99
drinkers
0.93
cups
0.88
cake
0.87
weed
0.85
cakes
0.84
drip
0.83
Activations Density 0.019%