INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Şubat
    1.14
     ľ
    1.12
     incline
    1.10
     Agustus
    1.10
     Sosial
    1.06
     imposs
    1.05
     egli
    1.04
     thine
    1.03
    1.03
     Jn
    1.01
    POSITIVE LOGITS
    ef
    1.29
    est
    1.03
    д
    1.02
    ek
    0.98
    0.97
    0.93
    एम
    0.92
    ens
    0.91
    কে
    0.91
    anding
    0.91
    Act Density 0.134%

    No Known Activations