INDEX
    Explanations

    occurrences of authorship and editorial roles in writing

    New Auto-Interp
    Negative Logits
    ora
    -0.16
    ominated
    -0.15
    Steam
    -0.15
    orra
    -0.14
    pac
    -0.14
     Ziel
    -0.14
     Ware
    -0.14
    ona
    -0.14
    ape
    -0.14
    ades
    -0.13
    POSITIVE LOGITS
    à¹Ģà¸ģ
    0.14
    竹
    0.14
    θμ
    0.14
    phins
    0.14
    imore
    0.14
    /gif
    0.14
    .TestTools
    0.14
    .gnu
    0.14
    Resistance
    0.14
    ulton
    0.14
    Act Density 0.022%

    No Known Activations