INDEX
Explanations
mentions of disabilities and related terms
references to disabilities
New Auto-Interp
Negative Logits
furt
-0.77
orically
-0.77
tons
-0.74
kov
-0.73
ãĥĥãĥĪ
-0.73
unin
-0.71
ellipt
-0.70
ebus
-0.70
achus
-0.68
anza
-0.68
POSITIVE LOGITS
disabilities
0.98
disability
0.98
Disability
0.96
Disabled
0.81
impair
0.79
Rights
0.77
rights
0.76
diagnoses
0.74
Discrimination
0.73
afe
0.71
Activations Density 0.034%