INDEX
Explanations
the mention of alcoholic beverages in the form of bottles
references to bottles, particularly in the context of beverages
New Auto-Interp
Negative Logits
merce
-0.87
doms
-0.78
urities
-0.73
uli
-0.69
yrinth
-0.69
IFA
-0.68
tale
-0.67
ahime
-0.65
adesh
-0.63
undai
-0.63
POSITIVE LOGITS
bottles
1.17
bottle
1.15
Bottle
0.99
Bott
0.93
Bott
0.89
opener
0.89
mark
0.82
labelled
0.82
refill
0.80
vodka
0.80
Activations Density 0.027%