INDEX
    Explanations

    exclamatory interjections or expressions of surprise

    expressions of surprise or exclamation

    New Auto-Interp
    Negative Logits
    Pub
    -0.70
    Prior
    -0.68
    Prem
    -0.66
    emi
    -0.64
    ESE
    -0.64
    ences
    -0.61
    Abstract
    -0.61
    és
    -0.59
    Dem
    -0.59
     Associates
    -0.58
    POSITIVE LOGITS
     oh
    3.54
     Oh
    1.62
    Oh
    1.49
    ohm
    1.41
     wow
    1.39
     ah
    1.38
    oh
    1.37
     hey
    1.36
     uh
    1.33
     eh
    1.32
    Act Density 0.012%

    No Known Activations