INDEX
    Explanations

    references to academic or technical papers

    various forms of the word "paper" and other related terms

    New Auto-Interp
    Negative Logits
    STD
    -0.75
    Qual
    -0.72
     exha
    -0.71
    Brend
    -0.71
     referen
    -0.67
    shall
    -0.66
     unden
    -0.64
    ¿½
    -0.63
    LESS
    -0.63
    VID
    -0.62
    POSITIVE LOGITS
    ®
    0.71
     ('
    0.70
    ives
    0.70
    utics
    0.68
     extraord
    0.68
    ilon
    0.64
    igans
    0.64
    dq
    0.63
    ffiti
    0.63
     ("
    0.63
    Act Density 0.338%

    No Known Activations