INDEX
    Explanations

    numerical references and citations in academic texts

    New Auto-Interp
    Negative Logits
    arkan
    -0.15
    ød
    -0.15
    umu
    -0.15
    strup
    -0.15
    або
    -0.15
     Virt
    -0.15
    patch
    -0.14
    DST
    -0.14
    ابÛĮ
    -0.14
    ее
    -0.14
    POSITIVE LOGITS
     Mick
    0.15
    fuse
    0.15
    ahun
    0.14
     Vacuum
    0.14
    ago
    0.14
    obl
    0.14
     Thatcher
    0.14
     daily
    0.14
     vacuum
    0.13
    iden
    0.13
    Act Density 0.030%

    No Known Activations