INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    я
    2.09
    it
    1.79
    ت
    1.47
    м
    1.46
    ле
    1.42
    ת
    1.41
    ри
    1.36
    س
    1.28
    f
    1.27
    את
    1.25
    POSITIVE LOGITS
    %,
    1.16
    pple
    1.13
     Nên
    1.11
    MG
    1.09
    ppled
    1.09
     ntawm
    1.08
    X
    1.08
     بلکه
    1.06
    siniz
    1.05
    Cadastro
    1.05
    Act Density 0.167%

    No Known Activations