INDEX
Explanations
words that evaluate health impacts as positive or negative
New Auto-Interp
Negative Logits
الحره
-0.64
featureID
-0.60
JspWriter
-0.59
anglès
-0.59
daging
-0.59
atigable
-0.58
Datuak
-0.58
Hackett
-0.57
setVerticalGroup
-0.57
StoryboardSegue
-0.54
POSITIVE LOGITS
beneficial
1.18
detrimental
0.93
Beneficial
0.93
harmful
0.87
benefit
0.81
healthy
0.76
benef
0.76
damaging
0.70
boon
0.69
benefiting
0.68
Activations Density 0.187%