INDEX
Explanations
references to ordering and menu items in food-related contexts
New Auto-Interp
Negative Logits
I
-0.59
P
-0.55
H
-0.53
T
-0.53
F
-0.52
A
-0.51
,
-0.50
B
-0.50
V
-0.49
<eos>
-0.49
POSITIVE LOGITS
Anſ
1.05
Theſe
1.04
purpoſe
1.04
CreateTagHelper
1.02
Efq
1.02
myſelf
1.00
Eſ
0.98
leaſt
0.97
ConstraintMaker
0.96
Monfieur
0.96
Activations Density 0.331%