INDEX
    Explanations

    expressions of gratitude and acknowledgment

    New Auto-Interp
    Negative Logits
    OLID
    -0.15
    rg
    -0.14
     peril
    -0.13
     solid
    -0.13
    bject
    -0.13
    ohn
    -0.13
    facebook
    -0.13
    hta
    -0.13
     inflated
    -0.13
     deÄŁ
    -0.13
    POSITIVE LOGITS
    istrovstvÃŃ
    0.17
    iyim
    0.15
    ovacÃŃ
    0.15
    olland
    0.14
     Www
    0.14
    lien
    0.14
     meis
    0.14
    ozem
    0.14
     myself
    0.14
    -mf
    0.14
    Act Density 0.039%

    No Known Activations