INDEX
    Explanations

    phrases related to personal responsibility and decision-making

    New Auto-Interp
    Negative Logits
    .mysql
    -0.16
    own
    -0.15
    achable
    -0.15
    mada
    -0.15
    ipur
    -0.15
    ään
    -0.15
    iber
    -0.14
    iyim
    -0.14
    bron
    -0.14
    estic
    -0.14
    POSITIVE LOGITS
     behalf
    0.37
     bagi
    0.20
    帮
    0.20
     há»Ļ
    0.19
    æĽ¿
    0.16
     automatically
    0.16
     dla
    0.16
    ë²½
    0.16
    essler
    0.16
     Autom
    0.15
    Act Density 0.212%

    No Known Activations