INDEX
    Explanations

    adjectives and adverbs that convey intensity or emphasis

    New Auto-Interp
    Negative Logits
    olon
    -0.16
    abbo
    -0.16
    ลà¸Ńà¸ĩ
    -0.15
    isten
    -0.14
    inati
    -0.14
    atch
    -0.14
    uto
    -0.14
    eva
    -0.14
    ubu
    -0.14
     hit
    -0.13
    POSITIVE LOGITS
    ãģıãģł
    0.16
    بÙĬع
    0.15
    apesh
    0.15
    onical
    0.15
    eyen
    0.15
    erness
    0.14
    665
    0.14
    ÑĢип
    0.14
    èĤ
    0.14
    paramref
    0.13
    Act Density 0.015%

    No Known Activations