INDEX
Explanations
phrases that refer to specific groups of people or medical conditions
references to individuals or groups with various attributes or conditions
New Auto-Interp
Negative Logits
orpor
-0.66
promul
-0.64
pher
-0.61
Authors
-0.60
among
-0.59
wark
-0.59
ecause
-0.58
finals
-0.58
oner
-0.57
once
-0.57
POSITIVE LOGITS
disabilities
1.19
propensity
1.01
penchant
0.94
disability
0.94
vested
0.92
backgrounds
0.91
allergies
0.90
predis
0.90
knack
0.88
incomes
0.88
Activations Density 0.219%