INDEX
Explanations
references to food and related serving items
New Auto-Interp
Negative Logits
unas
-0.16
rix
-0.15
wo
-0.15
ode
-0.15
endon
-0.14
stars
-0.14
sov
-0.14
stellar
-0.14
ufen
-0.14
Atlas
-0.14
POSITIVE LOGITS
desar
0.18
.localized
0.16
æ¡Į
0.15
abinet
0.15
ÑĥлÑĭ
0.15
Colts
0.14
kea
0.14
eza
0.14
æĶ¾åľ¨
0.14
æŀ¶
0.14
Activations Density 0.113%