INDEX
    Explanations

    phrases related to possession or control

    terms associated with concepts of control and governance

    New Auto-Interp
    Negative Logits
     redacted
    -0.62
     candid
    -0.62
    vasive
    -0.59
    livious
    -0.58
     admitting
    -0.57
     Invalid
    -0.56
     Rhod
    -0.54
    ipolar
    -0.53
    accompanied
    -0.53
     Missing
    -0.52
    POSITIVE LOGITS
    asses
    0.84
    vre
    0.84
    aces
    0.78
    adle
    0.74
    insula
    0.72
    irements
    0.72
    igree
    0.71
    rils
    0.70
    ills
    0.69
    elight
    0.68
    Act Density 0.266%

    No Known Activations