INDEX
Explanations
phrases describing delicious food and dining experiences
New Auto-Interp
Negative Logits
fusca
-0.55
femei
-0.53
rente
-0.52
coser
-0.50
pegat
-0.50
preventivo
-0.49
linho
-0.48
akaian
-0.46
cromado
-0.46
leão
-0.46
POSITIVE LOGITS
flavors
1.07
tasted
1.02
delicious
1.02
flavours
1.01
tastes
1.01
taste
0.97
tasting
0.96
flavor
0.95
savory
0.94
flavorful
0.94
Activations Density 0.489%