INDEX
Explanations
expressions related to food enjoyment and culinary experiences
New Auto-Interp
Negative Logits
odyn
-0.15
ourn
-0.15
UNT
-0.14
çon
-0.14
ór
-0.14
akeup
-0.14
меÑĤÑĮ
-0.14
eger
-0.13
exas
-0.13
ogo
-0.13
POSITIVE LOGITS
addictive
0.22
dro
0.19
swo
0.18
_hook
0.17
speech
0.17
converted
0.16
blown
0.16
addicted
0.16
gas
0.16
GAS
0.16
Activations Density 0.169%