INDEX
Explanations
references to leadership positions or roles associated with being in charge or commanding authority
New Auto-Interp
Negative Logits
fy
-0.18
ment
-0.15
atar
-0.15
伯
-0.15
edio
-0.15
ediator
-0.14
loyd
-0.14
eum
-0.14
place
-0.14
-minded
-0.14
POSITIVE LOGITS
quarters
0.23
lined
0.20
gear
0.18
quartered
0.17
rick
0.17
quarter
0.17
stock
0.17
alam
0.17
swith
0.17
locked
0.16
Activations Density 0.053%