INDEX
Explanations
words related to dietary habits
references to specific dietary guidelines and food groups
New Auto-Interp
Negative Logits
exha
-0.75
Witcher
-0.72
livest
-0.72
gobl
-0.69
streng
-0.69
unden
-0.69
ãĥ¼ãĥĨ
-0.68
âĸ¬
-0.68
surv
-0.68
shield
-0.67
POSITIVE LOGITS
wana
1.17
pace
1.03
heet
1.03
cape
1.02
peak
1.01
onic
0.98
ail
0.93
ilon
0.93
ets
0.93
tered
0.90
Activations Density 0.017%