INDEX
Explanations
references to cocktails and mixed drinks
New Auto-Interp
Negative Logits
lh
-0.16
inka
-0.16
gore
-0.15
InstantiationException
-0.15
Locker
-0.15
jed
-0.15
oins
-0.14
potatoes
-0.14
durum
-0.14
Brewery
-0.14
POSITIVE LOGITS
ice
0.35
Ice
0.31
Ice
0.27
ICE
0.26
ice
0.24
ICE
0.22
åĨ°
0.19
garn
0.19
iceberg
0.17
gren
0.17
Activations Density 0.056%