INDEX
    Explanations

    terms related to causing instability or disruption

    terms related to destabilization and its effects

    New Auto-Interp
    Negative Logits
    Rate
    -0.73
     Participant
    -0.67
    elf
    -0.67
    friend
    -0.66
    Tree
    -0.66
    une
    -0.65
    puted
    -0.65
     Origin
    -0.64
    endi
    -0.64
    Personal
    -0.64
    POSITIVE LOGITS
     destabil
    1.25
    icter
    0.92
    espie
    0.89
    izational
    0.83
     disarm
    0.82
     instability
    0.80
    ieties
    0.78
     Ukrain
    0.78
     Rohing
    0.78
     psychiat
    0.77
    Act Density 0.010%

    No Known Activations