INDEX
    Explanations

    phrases related to being confined or restricted

    New Auto-Interp
    Negative Logits
    lobber
    -0.15
    fic
    -0.14
    732
    -0.14
     Resist
    -0.14
    stalk
    -0.14
    urger
    -0.14
    |int
    -0.14
     Careers
    -0.14
    wick
    -0.13
    SI
    -0.13
    POSITIVE LOGITS
    Äįan
    0.16
    Cube
    0.15
    oda
    0.15
    reta
    0.14
    onto
    0.14
    athe
    0.14
    nds
    0.14
    êµIJ
    0.14
     Rash
    0.14
    usher
    0.13
    Act Density 0.039%

    No Known Activations