INDEX
Explanations
words related to the visual quality of food or presentation
New Auto-Interp
Negative Logits
ostel
-0.16
-UA
-0.15
erp
-0.15
udeau
-0.15
eut
-0.15
ingt
-0.15
pring
-0.15
agua
-0.14
êu
-0.14
ther
-0.14
POSITIVE LOGITS
atica
0.15
erman
0.15
arium
0.15
erna
0.14
ufen
0.14
capital
0.14
Invariant
0.14
Ack
0.14
iliar
0.13
ardo
0.13
Activations Density 0.002%