INDEX
Explanations
terms and phrases associated with eating and food consumption
New Auto-Interp
Negative Logits
']))
-0.87
}}],
-0.86
}));
-0.82
ρίου
-0.77
Prou
-0.75
])));
-0.74
));
-0.73
necesar
-0.72
enderror
-0.72
Guil
-0.71
POSITIVE LOGITS
eat
1.58
eaten
1.50
EAT
1.49
Eat
1.38
eats
1.35
eating
1.31
Eat
1.29
ate
1.24
Eating
1.22
Eating
1.20
Activations Density 0.063%