INDEX
    Explanations

    instances of governance and social issues, particularly around management and hierarchy

    New Auto-Interp
    Negative Logits
    åIJĹ
    -0.18
    åĹİ
    -0.17
    ber
    -0.15
    aban
    -0.15
    lic
    -0.15
    lj
    -0.14
     Nico
    -0.14
    od
    -0.14
     screen
    -0.14
     addCriterion
    -0.14
    POSITIVE LOGITS
     ÙĪÙħا
    0.21
    /how
    0.16
     besides
    0.15
    اÙħÙĩ
    0.15
    ysa
    0.15
     vur
    0.15
    ãĥ¼ãĥ«
    0.15
    ï¼Į以åıĬ
    0.15
    qi
    0.14
    oji
    0.14
    Act Density 0.253%

    No Known Activations