INDEX
Explanations
terms related to food and dining experiences
New Auto-Interp
Negative Logits
Soup
-0.20
spaghetti
-0.19
Salad
-0.18
éħĴ
-0.18
soup
-0.18
wine
-0.17
soup
-0.17
beers
-0.16
dinner
-0.16
Wine
-0.15
POSITIVE LOGITS
dough
0.40
Dough
0.37
pastry
0.32
bakery
0.30
muff
0.28
baker
0.27
Bakery
0.26
past
0.26
Baker
0.24
don
0.23
Activations Density 0.102%