INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Tested
    -0.10
    bereiche
    -0.09
     рез
    -0.09
     lalu
    -0.09
    (Request
    -0.09
     жүр
    -0.09
     dificuldade
    -0.09
     numb
    -0.09
     رك
    -0.08
     dificuldades
    -0.08
    POSITIVE LOGITS
    dens
    0.08
    ана
    0.07
    on
    0.07
    Coach
    0.07
     kal
    0.07
    coach
    0.07
    magn
    0.07
     Coach
    0.07
     pronunciation
    0.07
    ане
    0.07
    Act Density 0.001%

    No Known Activations