INDEX
Explanations
food-related items and descriptions
New Auto-Interp
Negative Logits
auer
-0.17
habit
-0.15
éĸ¢éĢ£
-0.14
frica
-0.14
.Toolkit
-0.14
ernel
-0.14
plier
-0.13
ΣεÏĢ
-0.13
CCI
-0.13
Å¥
-0.13
POSITIVE LOGITS
choice
0.20
choice
0.17
served
0.17
Temp
0.16
our
0.16
(choice
0.15
Choice
0.15
-choice
0.15
-house
0.15
special
0.15
Activations Density 0.028%