INDEX
    Explanations

    terms and phrases related to human rights issues

    New Auto-Interp
    Negative Logits
    RetentionPolicy
    -0.83
     ویکی‌پدیا
    -0.74
     reconna
    -0.71
    accepte
    -0.69
    Personendaten
    -0.69
     suffit
    -0.68
    multirow
    -0.68
     initComponents
    -0.68
    IsContent
    -0.68
    __':
    -0.67
    POSITIVE LOGITS
     déclaration
    0.57
     lung
    0.56
     fucking
    0.56
     blame
    0.56
     diritti
    0.55
     rights
    0.55
     declare
    0.53
     declaration
    0.53
     mierda
    0.53
    cols
    0.52
    Act Density 0.103%

    No Known Activations