INDEX
    Explanations

    exclamations expressing surprise or strong emotion

    expressions of surprise or exclamation

    New Auto-Interp
    Negative Logits
    20439
    -0.72
    -+-+
    -0.66
    ":[{"
    -0.66
    resso
    -0.65
     CTR
    -0.62
    iband
    -0.61
    Introduced
    -0.61
    */(
    -0.60
    perature
    -0.59
    ngth
    -0.59
    POSITIVE LOGITS
     yeah
    1.23
    hhhh
    1.18
    hhh
    1.12
    hh
    1.12
     yea
    1.05
    yeah
    1.03
     dear
    1.01
     hey
    0.99
     yes
    0.93
     wow
    0.93
    Act Density 0.022%

    No Known Activations