INDEX
Explanations
phrases related to food and cooking
New Auto-Interp
Negative Logits
sugar
-0.16
маÑģло
-0.15
sugars
-0.15
Baker
-0.15
lobal
-0.15
tica
-0.15
ä½ı
-0.15
meisjes
-0.14
rán
-0.14
dess
-0.14
POSITIVE LOGITS
soup
0.53
Soup
0.47
soup
0.43
Soup
0.41
broth
0.41
sou
0.39
Sou
0.34
湯
0.32
_soup
0.31
Sou
0.31
Activations Density 0.081%