INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    ש
    1.45
    lerde
    1.08
     użyt
    1.07
    lerinden
    1.05
    ların
    1.04
    ̣t
    1.02
     vrch
    1.01
    ność
    1.00
     przyczyn
    1.00
    blätter
    1.00
    POSITIVE LOGITS
     TEAM
    1.41
    Team
    1.35
     Team
    1.34
    م
    1.34
     halinde
    1.32
    ה
    1.28
    TEAM
    1.16
    1.15
    ει
    1.13
    ים
    1.13
    Act Density 0.151%

    No Known Activations