INDEX
Explanations
terms related to food and health services
New Auto-Interp
Negative Logits
ym
-0.21
yt
-0.18
addle
-0.17
inen
-0.16
upy
-0.16
alse
-0.15
Bill
-0.15
bill
-0.15
gr
-0.15
EB
-0.15
POSITIVE LOGITS
.asc
0.16
icker
0.15
omer
0.15
/Dk
0.14
_SO
0.14
.SO
0.14
çĸĨ
0.14
praak
0.14
arkan
0.14
(EFFECT
0.14
Activations Density 0.040%