INDEX
    Explanations

    expressions of surprise or exclamation

    New Auto-Interp
    Negative Logits
     myſelf
    -1.07
     Efq
    -0.95
     themſelves
    -0.88
     houſe
    -0.88
     itſelf
    -0.86
     himſelf
    -0.85
     Majefty
    -0.83
     cdti
    -0.82
     useRouter
    -0.80
     againſt
    -0.77
    POSITIVE LOGITS
     Oh
    1.06
    Oh
    1.02
     oh
    0.91
    oh
    0.87
     sweet
    0.78
     OH
    0.76
    sweet
    0.68
     мәкал
    0.68
    OH
    0.67
    Ohh
    0.66
    Act Density 0.037%

    No Known Activations