INDEX
Explanations
references to healthy food options and their benefits
New Auto-Interp
Negative Logits
dech
-0.15
uela
-0.15
лаÑĪ
-0.15
lfw
-0.15
ideo
-0.14
WSC
-0.14
bourg
-0.14
krom
-0.14
aji
-0.14
nels
-0.14
POSITIVE LOGITS
yogurt
0.42
Greek
0.39
yog
0.38
Yog
0.36
Greek
0.35
cottage
0.26
Greece
0.24
plain
0.24
Greeks
0.24
milk
0.23
Activations Density 0.036%