INDEX
Explanations
terms related to management and leadership roles
New Auto-Interp
Negative Logits
ãģ°
-0.16
enou
-0.15
bum
-0.15
olle
-0.15
ucher
-0.14
oro
-0.14
Ñĥгод
-0.14
важ
-0.13
_drv
-0.13
fall
-0.13
POSITIVE LOGITS
member
0.15
ëĭĺ
0.15
imity
0.14
APH
0.14
jin
0.14
TRACE
0.14
em
0.14
Member
0.14
Collapsed
0.14
ca
0.14
Activations Density 0.008%