INDEX
Explanations
adjectives related to health and wellness
references to health conditions and states of illness
New Auto-Interp
Negative Logits
Flavoring
-0.73
breed
-0.61
MER
-0.61
ggles
-0.61
delim
-0.60
Us
-0.59
Rule
-0.58
poll
-0.58
guid
-0.57
href
-0.57
POSITIVE LOGITS
ridden
0.85
retty
0.82
usional
0.81
cember
0.76
angering
0.71
figured
0.70
blind
0.66
psychiat
0.66
enough
0.65
jured
0.65
Activations Density 0.263%