INDEX
Explanations
references to alcoholic beverages, specifically beer and wine
New Auto-Interp
Negative Logits
AssemblyCulture
-0.64
]")]
-0.56
arXiv
-0.56
Reverso
-0.54
AttributeSet
-0.50
AntiForgeryToken
-0.50
strated
-0.50
umper
-0.49
rodo
-0.48
ulado
-0.46
POSITIVE LOGITS
bottles
0.86
beverage
0.84
beverages
0.82
drinkers
0.80
bottle
0.78
BOTT
0.76
Drinking
0.74
drinker
0.73
drinking
0.73
Bottles
0.73
Activations Density 0.064%