INDEX
    Explanations

    phrases related to rules, authority, power, or control

    terms associated with authority and leadership transitions

    New Auto-Interp
    Negative Logits
    ertodd
    -0.71
    hammad
    -0.71
     contrace
    -0.61
    FFER
    -0.61
     Humanity
    -0.61
    Quotes
    -0.61
     defe
    -0.59
    Gre
    -0.59
    Kin
    -0.58
    endix
    -0.58
    POSITIVE LOGITS
    pin
    0.89
    s
    0.88
    ited
    0.84
    uin
    0.79
    nant
    0.78
    esses
    0.78
    pins
    0.78
    unders
    0.77
    ping
    0.77
    iever
    0.75
    Act Density 0.015%

    No Known Activations