INDEX
Explanations
references to leadership roles and organizational involvement
New Auto-Interp
Negative Logits
Âł
-0.37
2
-0.27
3
-0.26
Âł↵
-0.26
1
-0.26
0
-0.25
9
-0.25
–
-0.25
4
-0.24
15
-0.24
POSITIVE LOGITS
ü
0.19
________________________________________________________________
0.18
ó
0.18
________________________________
0.16
é
0.16
__________________________________
0.15
.','
0.15
&a
0.15
----------↵
0.15
________________
0.15
Activations Density 0.010%