INDEX
Explanations
instructions for preparing or cooking food
New Auto-Interp
Negative Logits
upaya
-0.47
ine
-0.46
i
-0.45
efforts
-0.44
<eos>
-0.43
I
-0.42
for
-0.40
hat
-0.39
-0.39
,
-0.38
POSITIVE LOGITS
AndEndTag
1.14
Efq
1.10
myſelf
1.08
Theſe
1.04
itſelf
1.03
Jefus
1.03
ſeveral
1.02
SBATCH
1.02
himſelf
0.99
pleaſure
0.97
Activations Density 0.302%