INDEX
    Explanations

    adverbs that convey surprise or emphasis

    New Auto-Interp
    Negative Logits
    ays
    -0.15
    åħĭæĸ¯
    -0.14
    åŀ
    -0.14
    lst
    -0.14
     Leban
    -0.14
     Hairst
    -0.14
     Bernardino
    -0.13
     wsz
    -0.13
    å¾³
    -0.13
     Monter
    -0.13
    POSITIVE LOGITS
    forge
    0.17
    ÏĢη
    0.16
    lijk
    0.16
    iae
    0.16
    GGLE
    0.15
    ÏĩεδÏĮν
    0.15
    amble
    0.15
     Holl
    0.15
    omal
    0.15
    ably
    0.15
    Act Density 0.066%

    No Known Activations