INDEX
Explanations
mentions of various forms of disabilities
references to disabilities and related terms
New Auto-Interp
Negative Logits
furt
-0.89
Collider
-0.76
kov
-0.73
orically
-0.72
ellipt
-0.70
anza
-0.68
tons
-0.66
ãĥĥãĥĪ
-0.66
andum
-0.65
Vald
-0.64
POSITIVE LOGITS
disabilities
0.96
disability
0.92
Disability
0.89
impair
0.85
Disabled
0.78
onge
0.75
diagnoses
0.74
abled
0.73
cripp
0.71
disabled
0.71
Activations Density 0.044%