INDEX
Explanations
titles or roles related to leadership positions such as "chief."
titles or roles associated with leadership or senior positions
New Auto-Interp
Negative Logits
unp
-0.67
reciproc
-0.66
zero
-0.66
res
-0.66
valid
-0.64
que
-0.63
prep
-0.62
values
-0.61
multiple
-0.61
physically
-0.60
POSITIVE LOGITS
chief
4.66
Chief
2.56
chief
1.50
Chief
1.46
fortune
1.35
chiefs
1.15
Chiefs
1.12
leader
1.00
ataka
0.99
former
0.94
Activations Density 0.016%