INDEX
    Explanations

    numeric identifiers or data points

    New Auto-Interp
    Negative Logits
    berman
    -0.15
    usive
    -0.15
    åŁ
    -0.14
    yna
    -0.14
    سر
    -0.13
    :
    -0.13
     beg
    -0.13
    azzo
    -0.13
    udes
    -0.13
    rie
    -0.13
    POSITIVE LOGITS
    ir
    0.20
    undler
    0.17
     Obr
    0.15
    Binder
    0.14
    iban
    0.14
    ilir
    0.14
    abus
    0.14
    ag
    0.14
    lineno
    0.14
     PropTypes
    0.14
    Act Density 0.382%

    No Known Activations