INDEX
Explanations
phrases related to negative emotional states
statements about occurrences or conditions
New Auto-Interp
Negative Logits
poke
-0.72
races
-0.66
resp
-0.65
Write
-0.65
wrote
-0.62
WRITE
-0.62
learns
-0.62
Supports
-0.62
itely
-0.61
Classes
-0.60
POSITIVE LOGITS
exacerbated
1.20
contrasted
1.08
attributable
1.04
outwe
1.04
evident
1.04
sympt
1.02
contagious
1.02
overshadow
1.01
reflected
1.00
compounded
1.00
Activations Density 0.377%