INDEX
    Explanations

    indicators of authorship and citation in written content

    New Auto-Interp
    Negative Logits
    plier
    -0.16
     Fibonacci
    -0.14
    elper
    -0.14
    lessly
    -0.13
    SPATH
    -0.13
     Hyp
    -0.13
    สะ
    -0.13
    elts
    -0.13
    elong
    -0.13
    preter
    -0.13
    POSITIVE LOGITS
     Uncategorized
    0.17
     DISCLAIM
    0.16
     jadx
    0.14
    utenberg
    0.14
    ılı
    0.13
    (Arg
    0.13
    ë¡Ģ
    0.13
    abcdefghijkl
    0.13
    ôn
    0.13
     strtoupper
    0.13
    Act Density 0.042%

    No Known Activations