INDEX
Explanations
instructions related to cooking or food preparation
New Auto-Interp
Negative Logits
åĽ
-0.15
atri
-0.15
lamaz
-0.15
onis
-0.14
anford
-0.14
Manip
-0.13
damp
-0.13
pás
-0.13
blame
-0.13
Baker
-0.13
POSITIVE LOGITS
boil
0.40
boils
0.31
boiling
0.31
-bo
0.29
Bo
0.29
simmer
0.28
bo
0.26
_bo
0.23
boilers
0.23
boiled
0.23
Activations Density 0.023%