INDEX
    Explanations

    phrases related to personal responsibility and accountability

    New Auto-Interp
    Negative Logits
    argas
    -0.15
     im
    -0.14
    (s
    -0.14
    aight
    -0.14
    --
    -0.14
    onga
    -0.14
     fil
    -0.14
     Lone
    -0.13
     civil
    -0.13
     Ext
    -0.13
    POSITIVE LOGITS
    â̦↵↵↵
    0.21
    oldur
    0.16
    edback
    0.16
    ConfigurationException
    0.15
    $MESS
    0.15
    ÄįnÃŃk
    0.14
    adnÃŃ
    0.14
    yonel
    0.14
    bstract
    0.14
    inflate
    0.14
    Act Density 0.928%

    No Known Activations