INDEX
Explanations
adjectives related to physical ability or disability
terms related to ability and disability
New Auto-Interp
Negative Logits
Canaver
-0.73
ocene
-0.63
uble
-0.61
Guinness
-0.60
Corp
-0.60
ppard
-0.59
earable
-0.59
arty
-0.59
urized
-0.58
ourke
-0.58
POSITIVE LOGITS
ness
0.83
guiActiveUn
0.75
NESS
0.72
trades
0.68
nesses
0.65
citiz
0.64
iaries
0.64
iary
0.59
cred
0.59
themselves
0.58
Activations Density 0.259%