INDEX
Explanations
references to cocktails and drinks
New Auto-Interp
Negative Logits
%p
-0.15
GOODMAN
-0.15
ÎŃλ
-0.15
cooker
-0.14
_USERNAME
-0.14
.Context
-0.14
lander
-0.13
bake
-0.13
.Transactional
-0.13
breadcrumb
-0.13
POSITIVE LOGITS
served
0.17
-serving
0.16
ìŀĶ
0.15
jing
0.15
glasses
0.15
473
0.15
deniz
0.14
uis
0.14
serve
0.14
bartender
0.14
Activations Density 0.037%