INDEX
    Explanations

    punctuation marks, particularly periods and commas

    New Auto-Interp
    Negative Logits
    attempt
    -0.07
    gings
    -0.07
    βολ
    -0.07
    VERTISEMENT
    -0.07
    Attempt
    -0.06
    lendirme
    -0.06
    ÏĨεÏģ
    -0.06
    AGR
    -0.06
    uai
    -0.06
    à¹īาà¸ĩ
    -0.06
    POSITIVE LOGITS
     themselves
    0.07
     sayesinde
    0.07
     otherwise
    0.07
    -sama
    0.07
     even
    0.07
     resulting
    0.06
    avid
    0.06
    rof
    0.06
     pit
    0.06
     cul
    0.06
    Act Density 0.048%

    No Known Activations