INDEX
    Explanations

    words related to political power

    terms related to power dynamics and distribution

    New Auto-Interp
    Negative Logits
    riad
    -0.71
     Taste
    -0.71
     Von
    -0.70
    verett
    -0.69
    romeda
    -0.69
    eret
    -0.67
    ead
    -0.67
    TAG
    -0.67
    ogene
    -0.67
    ALK
    -0.66
    POSITIVE LOGITS
     levers
    1.00
    houses
    0.98
     vested
    0.95
     wielded
    0.87
    lessness
    0.85
    FUL
    0.81
    stroke
    0.80
    lifting
    0.80
     outage
    0.79
     vacuum
    0.77
    Act Density 0.037%

    No Known Activations