INDEX
Explanations
references to different abilities or capabilities
phrases related to capabilities or competencies
New Auto-Interp
Negative Logits
Surv
-0.67
gar
-0.66
adish
-0.63
anski
-0.63
roman
-0.62
Observ
-0.62
Wrap
-0.61
gone
-0.61
Bride
-0.61
Coconut
-0.61
POSITIVE LOGITS
destro
0.90
auga
0.87
ibilities
0.86
ibility
0.86
Ability
0.85
impaired
0.83
ability
0.82
Reviewer
0.77
incap
0.75
bodied
0.74
Activations Density 0.030%