INDEX
Explanations
terms related to leadership and influential roles
New Auto-Interp
Negative Logits
arella
-0.17
ation
-0.17
pector
-0.17
phe
-0.16
nem
-0.15
ee
-0.15
ten
-0.15
Ø®ÛĮ
-0.15
ting
-0.15
asion
-0.14
POSITIVE LOGITS
hip
0.20
less
0.20
ìĭŃ
0.18
edBy
0.18
role
0.17
-edge
0.17
ial
0.16
HIP
0.16
/command
0.16
role
0.16
Activations Density 0.029%