INDEX
    Explanations

    instances of numerical values and dates in the text

    New Auto-Interp
    Negative Logits
    ıza
    -0.08
     poil
    -0.08
    iju
    -0.08
    Ư
    -0.08
    имо
    -0.07
    shaw
    -0.07
     LENG
    -0.07
    _________________↵↵
    -0.07
     baÅŁlan
    -0.07
    oux
    -0.07
    POSITIVE LOGITS
    TAG
    0.07
    463
    0.06
    825
    0.06
    DOI
    0.06
     tags
    0.05
     Fol
    0.05
     contested
    0.05
     fol
    0.05
    467
    0.05
    OO
    0.05
    Act Density 0.039%

    No Known Activations