INDEX
    Explanations

    citations and references in academic documents

    New Auto-Interp
    Negative Logits
     Dig
    -0.16
    399
    -0.15
     Dum
    -0.14
    digest
    -0.14
     dig
    -0.14
    dig
    -0.14
     tenants
    -0.14
    arms
    -0.14
    ypo
    -0.13
    asi
    -0.13
    POSITIVE LOGITS
     Rudd
    0.17
    odus
    0.17
    ZO
    0.16
    ugas
    0.15
    orado
    0.15
     Eld
    0.15
     unders
    0.15
    iens
    0.15
    *)(
    0.14
    arkan
    0.14
    Act Density 0.003%

    No Known Activations