INDEX
    Explanations

    phrases related to threats or potential negative impacts

    references to threats or damage to rights and institutions

    New Auto-Interp
    Negative Logits
    ãĤ´ãĥ³
    -0.79
    omer
    -0.73
    hots
    -0.71
    owler
    -0.71
    oaded
    -0.70
    placed
    -0.69
    --+
    -0.69
    arate
    -0.68
    atoon
    -0.68
    tackle
    -0.68
    POSITIVE LOGITS
     integrity
    1.53
     livelihood
    1.48
     viability
    1.47
     credibility
    1.31
     wellbeing
    1.25
     lives
    1.25
     stability
    1.24
     validity
    1.23
     reliability
    1.20
     effectiveness
    1.19
    Act Density 0.211%

    No Known Activations