INDEX
Explanations
mentions of coffee
mentions of coffee and related beverages
New Auto-Interp
Negative Logits
etr
-0.94
yth
-0.84
Torn
-0.79
alties
-0.75
Mesh
-0.73
Surviv
-0.71
Div
-0.70
worthiness
-0.69
miss
-0.68
Gir
-0.68
POSITIVE LOGITS
coffee
3.54
Coffee
2.71
espresso
2.41
coff
2.10
tea
1.98
caffeine
1.92
ffee
1.89
Starbucks
1.88
cocoa
1.85
caffe
1.78
Activations Density 0.008%