INDEX
    Explanations

    punctuation marks, particularly periods and commas, indicating sentence endings and pauses

    New Auto-Interp
    Negative Logits
    aint
    -0.16
    amburger
    -0.15
    opa
    -0.15
    ilian
    -0.14
    ifen
    -0.14
    enny
    -0.14
    hots
    -0.14
    ãĥĨãĥ«
    -0.14
    idebar
    -0.13
    achs
    -0.13
    POSITIVE LOGITS
     ç±
    0.16
    ÏģÏĮ
    0.14
    compose
    0.13
    yl
    0.13
     ç©
    0.13
     Engel
    0.13
    rots
    0.13
    ypad
    0.13
     longitud
    0.13
    Ñıм
    0.13
    Act Density 0.099%

    No Known Activations