INDEX
    Explanations

    patterns related to citations and article formatting in scholarly references

    New Auto-Interp
    Negative Logits
    ore
    -0.16
    elves
    -0.16
    ement
    -0.15
    osh
    -0.14
    ulle
    -0.14
    ince
    -0.14
    tes
    -0.14
    erg
    -0.14
    edic
    -0.14
    никÑĸв
    -0.13
    POSITIVE LOGITS
    íĥĦ
    0.14
    isbury
    0.14
    GuidId
    0.14
    ìļķ
    0.14
    enario
    0.13
    HeaderCode
    0.13
    à¸Ńà¸Ķ
    0.13
    íĥķ
    0.13
    irket
    0.13
    /fw
    0.13
    Act Density 0.006%

    No Known Activations