INDEX
    Explanations

    references to physical sensations and emotional expressions

    New Auto-Interp
    Negative Logits
    bjerg
    -0.16
    RITE
    -0.15
    ovit
    -0.15
    adero
    -0.15
    apiro
    -0.15
    oins
    -0.14
    ugins
    -0.14
     Yön
    -0.14
    ritel
    -0.14
    idia
    -0.14
    POSITIVE LOGITS
     interrupt
    0.16
    /archive
    0.15
    analysis
    0.15
    amp
    0.14
    cho
    0.14
    653
    0.14
    thing
    0.14
     lạc
    0.14
    ï¼
    0.13
    kee
    0.13
    Act Density 0.161%

    No Known Activations