INDEX
Explanations
occurrences of specific words related to food and recipes
New Auto-Interp
Negative Logits
v
-0.19
r
-0.19
res
-0.18
d
-0.17
aa
-0.17
amm
-0.17
ings
-0.17
ritt
-0.17
ri
-0.17
rr
-0.17
POSITIVE LOGITS
ments
0.20
ngine
0.20
ddie
0.19
cht
0.18
chts
0.18
xp
0.18
ngo
0.17
uther
0.17
mployee
0.17
uil
0.17
Activations Density 0.047%