INDEX
Explanations
references to specific food items or ingredients
New Auto-Interp
Negative Logits
ered
-0.15
roma
-0.15
Fahr
-0.14
мо
-0.14
ÑĢом
-0.14
meer
-0.14
reds
-0.14
Ed
-0.14
ATUS
-0.14
matt
-0.14
POSITIVE LOGITS
ome
0.27
ears
0.25
esto
0.23
anko
0.23
imiento
0.21
OME
0.21
ât
0.21
umper
0.20
ate
0.19
ita
0.19
Activations Density 0.009%