INDEX
    Explanations

    phrases concerning accountability and legal responsibility

    New Auto-Interp
    Negative Logits
    erty
    -0.17
    ltra
    -0.16
    akt
    -0.15
    imore
    -0.14
    óm
    -0.14
    gratis
    -0.14
    /Runtime
    -0.14
    μη
    -0.14
    amanho
    -0.14
    esian
    -0.14
    POSITIVE LOGITS
     responsible
    0.63
    res
    0.58
    -res
    0.51
     responsable
    0.48
     respons
    0.48
     Responsible
    0.48
     RESPONS
    0.47
    ponsible
    0.47
     responsibility
    0.46
     accountable
    0.45
    Act Density 0.160%

    No Known Activations