INDEX
Explanations
words related to competence and incompetence
terms associated with competence and incompetence in various contexts
New Auto-Interp
Negative Logits
forest
-0.77
den
-0.75
eed
-0.74
hop
-0.73
Pause
-0.72
eda
-0.70
patch
-0.70
eper
-0.70
Hop
-0.70
fter
-0.69
POSITIVE LOGITS
incompetent
1.10
glers
1.06
competent
1.04
incompetence
0.93
umbn
0.89
incompet
0.87
competence
0.86
abama
0.84
inois
0.81
ocrats
0.76
Activations Density 0.029%