INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ~,
    -0.06
    722
    -0.06
     таблет
    -0.06
    ^n
    -0.06
     Lei
    -0.06
     ermög
    -0.06
    ых
    -0.06
    .ot
    -0.06
     lc
    -0.06
    ument
    -0.06
    POSITIVE LOGITS
     Franklin
    0.37
    lin
    0.12
    LIN
    0.10
     Randolph
    0.09
    _BUCKET
    0.07
     pitching
    0.07
     وقد
    0.07
    ña
    0.07
     violates
    0.06
     федераль
    0.06
    Act Density 0.002%

    No Known Activations