INDEX
    Explanations

    instances of being physically confined or restricted

    instances of the word "locked" and its variations, indicating confinement or restriction

    New Auto-Interp
    Negative Logits
     exaggeration
    -0.72
    ahon
    -0.70
     Interpret
    -0.69
    aste
    -0.68
    Footnote
    -0.67
    plot
    -0.66
     Insp
    -0.65
     Cosponsors
    -0.65
    eness
    -0.65
    brate
    -0.65
    POSITIVE LOGITS
     locked
    3.45
     Locked
    2.22
     lock
    2.06
    locked
    2.01
     locking
    2.01
     unlocked
    1.97
     locks
    1.91
    Lock
    1.54
     chained
    1.51
     Lock
    1.46
    Act Density 0.018%

    No Known Activations