INDEX
    Explanations

    references to political accountability and investigation

    New Auto-Interp
    Negative Logits
     meines
    -0.66
     виправивши
    -0.66
     minhas
    -0.66
     my
    -0.62
     meus
    -0.62
     meiner
    -0.62
     nossos
    -0.61
     our
    -0.60
     mijn
    -0.59
     meu
    -0.58
    POSITIVE LOGITS
     “
    1.79
     "
    1.78
     “…
    1.68
     "...
    1.61
     “[
    1.57
     “...
    1.55
     "[
    1.50
     «
    1.41
    ,「
    1.40
    1.31
    Act Density 1.713%

    No Known Activations