INDEX
Explanations
phrases related to health conditions and medical attributes
New Auto-Interp
Negative Logits
wark
-0.77
rail
-0.70
rand
-0.70
hack
-0.68
oult
-0.67
cade
-0.67
doing
-0.66
ernaut
-0.65
March
-0.65
alion
-0.63
POSITIVE LOGITS
disabilities
1.71
histories
1.09
differing
1.02
weakened
1.02
severe
1.00
autism
0.99
incomes
0.97
impair
0.96
compromised
0.94
impaired
0.93
Activations Density 0.107%