INDEX
    Explanations

    formatting and structural elements commonly found in academic writing

    New Auto-Interp
    Negative Logits
    iol
    -0.07
    144
    -0.06
    oda
    -0.06
    .gwt
    -0.06
    agne
    -0.06
     voks
    -0.06
    cout
    -0.06
    quir
    -0.06
     Ø¢Ùħ
    -0.06
    ahat
    -0.06
    POSITIVE LOGITS
     ëħ¼
    0.07
     REFERENCES
    0.07
     Foot
    0.07
    bine
    0.07
     foot
    0.06
     JK
    0.06
     TMPro
    0.06
     laid
    0.06
    adero
    0.06
    ÄįnÃŃk
    0.06
    Act Density 0.052%

    No Known Activations