INDEX
    Explanations

    interjections expressing surprise or exclamation

    expressions of surprise or realization

    New Auto-Interp
    Negative Logits
    IUM
    -0.83
     Construct
    -0.73
    */(
    -0.73
    eers
    -0.73
    -+-+
    -0.70
     awoken
    -0.69
     Fired
    -0.68
    Introduced
    -0.68
    Registered
    -0.67
    flies
    -0.65
    POSITIVE LOGITS
    umph
    0.97
    hhhh
    0.95
    oho
    0.94
    oh
    0.91
    hh
    0.89
    anian
    0.85
    hhh
    0.82
    warts
    0.82
    Oh
    0.80
    ohan
    0.79
    Act Density 0.008%

    No Known Activations