INDEX
Explanations
instances of governance and social issues, particularly around management and hierarchy
New Auto-Interp
Negative Logits
åIJĹ
-0.18
åĹİ
-0.17
ber
-0.15
aban
-0.15
lic
-0.15
lj
-0.14
Nico
-0.14
od
-0.14
screen
-0.14
addCriterion
-0.14
POSITIVE LOGITS
ÙĪÙħا
0.21
/how
0.16
besides
0.15
اÙħÙĩ
0.15
ysa
0.15
vur
0.15
ãĥ¼ãĥ«
0.15
ï¼Į以åıĬ
0.15
qi
0.14
oji
0.14
Activations Density 0.253%