INDEX
Explanations
names of managerial positions in various organizations
mentions of managerial roles or titles
New Auto-Interp
Negative Logits
çīĪ
-0.84
ENC
-0.80
FER
-0.78
cale
-0.65
milo
-0.65
ilings
-0.63
lihood
-0.62
prelim
-0.62
bent
-0.62
encing
-0.62
POSITIVE LOGITS
iewicz
0.80
ial
0.79
stadt
0.78
ials
0.75
anova
0.71
oeuv
0.70
eers
0.70
owitz
0.69
ihilation
0.69
iola
0.69
Activations Density 0.024%