INDEX
    Explanations

    words related to actions and states of being

    New Auto-Interp
    Negative Logits
    velope
    -0.17
    plug
    -0.15
    <?,
    -0.15
    IRECTION
    -0.14
    747
    -0.14
     nurs
    -0.14
    lyn
    -0.14
    å½
    -0.14
     reminis
    -0.13
    akes
    -0.13
    POSITIVE LOGITS
    ogan
    0.15
    edn
    0.14
    aeda
    0.14
    ypi
    0.14
    оÑĢÑĭ
    0.14
    EIF
    0.14
    rary
    0.14
    unte
    0.13
     Fol
    0.13
    .tc
    0.13
    Act Density 0.064%

    No Known Activations