INDEX
Explanations
references to food and beverage-related content
New Auto-Interp
Negative Logits
foods
-0.30
Foods
-0.29
Food
-0.24
foods
-0.24
meals
-0.23
comida
-0.23
Food
-0.23
FOOD
-0.23
alimentos
-0.23
cuisine
-0.22
POSITIVE LOGITS
drink
0.39
beverage
0.32
Drink
0.31
Beverage
0.31
drinks
0.31
beverages
0.30
Bever
0.30
drink
0.29
Drink
0.27
Drinks
0.26
Activations Density 0.082%