INDEX
Explanations
ingredients or cooking instructions related to a specific recipe
New Auto-Interp
Negative Logits
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.66
hips
-0.65
hart
-0.65
bolt
-0.64
ambling
-0.63
Mobility
-0.63
iosity
-0.62
beit
-0.62
Gry
-0.60
inez
-0.60
POSITIVE LOGITS
urized
1.58
ur
1.16
paste
0.96
ures
0.90
ured
0.85
URES
0.84
ure
0.83
uring
0.82
uri
0.81
urs
0.78
Activations Density 0.037%