INDEX
Explanations
elements and descriptions related to food dishes
New Auto-Interp
Negative Logits
Baker
-0.16
meisjes
-0.16
snack
-0.16
Bakery
-0.15
lobal
-0.15
tica
-0.15
Cake
-0.14
baker
-0.14
bakery
-0.14
ä½ı
-0.14
POSITIVE LOGITS
soup
0.59
Soup
0.54
sou
0.50
soup
0.49
Soup
0.45
broth
0.44
Sou
0.44
Sou
0.37
oup
0.36
_soup
0.35
Activations Density 0.093%