INDEX
    Explanations

    words related to holding beliefs or positions

    expressions of possessing or maintaining views and positions of power

    New Auto-Interp
    Negative Logits
    ————
    -0.78
    issance
    -0.71
    lease
    -0.71
    lez
    -0.68
    ese
    -0.67
    ombies
    -0.66
    ghan
    -0.66
    endix
    -0.65
    FTWARE
    -0.64
    ibel
    -0.63
    POSITIVE LOGITS
     sway
    1.27
     onto
    1.11
     accountable
    0.99
     steady
    0.92
     hostage
    0.90
     captive
    0.87
     dear
    0.83
    overs
    0.81
    hold
    0.81
     secrets
    0.81
    Act Density 0.040%

    No Known Activations