INDEX
    Explanations

    references to citations and footnotes in a document

    New Auto-Interp
    Negative Logits
    åħ·
    -0.16
    igne
    -0.16
    rum
    -0.15
    flo
    -0.15
    opp
    -0.15
    ith
    -0.14
    CreatedBy
    -0.14
    apy
    -0.14
    una
    -0.14
    GRES
    -0.14
    POSITIVE LOGITS
    862
    0.17
    .nih
    0.16
    zeros
    0.14
     Sentinel
    0.14
    lingen
    0.13
     SENT
    0.13
     Scr
    0.13
    piler
    0.13
     recip
    0.13
    /lic
    0.13
    Act Density 0.026%

    No Known Activations