INDEX
    Explanations

    expressions of surprise or excitement

    Interjections or affirmations

    expressive sounds and interjections

    New Auto-Interp
    Negative Logits
     Мексичка
    -1.14
     itſelf
    -1.05
     Efq
    -1.04
    )");
    
    -1.02
    )";
    
    -1.02
     iſt
    -1.02
     ―――――
    -1.01
    $")
    -1.00
    neſs
    -1.00
     فريبيس
    -0.97
    POSITIVE LOGITS
    !
    0.79
     I
    0.77
    <eos>
    0.68
     you
    0.63
    0.63
     freakin
    0.62
     freaking
    0.62
    !!!
    0.60
     Oh
    0.58
     yeah
    0.57
    Act Density 0.176%

    No Known Activations