INDEX
Explanations
statistical comparisons indicating likelihood or probability
comparative phrases related to demographic statistics and behaviors
New Auto-Interp
Negative Logits
enment
-0.70
ablishment
-0.68
Refresh
-0.67
ionics
-0.67
emonium
-0.65
acly
-0.65
imentary
-0.64
anian
-0.64
Wonderful
-0.63
seq
-0.62
POSITIVE LOGITS
disproportionately
1.25
happier
1.19
happiest
1.11
disproportion
1.09
wealthier
1.02
healthier
1.01
significantly
0.99
discriminated
0.98
more
0.96
twice
0.96
Activations Density 0.143%