INDEX
Explanations
references to meals and food-related activities
New Auto-Interp
Negative Logits
Muk
-0.16
oda
-0.15
ickle
-0.14
Barney
-0.14
.bin
-0.14
ç§
-0.14
bin
-0.14
g
-0.14
[
-0.13
Material
-0.13
POSITIVE LOGITS
meals
0.53
meal
0.46
Meals
0.37
Meal
0.36
meal
0.36
Meal
0.35
飯
0.27
food
0.27
-me
0.26
nutritious
0.23
Activations Density 0.100%