INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     ﺍﻟ
    1.40
     Ansible
    1.38
     Kähler
    1.29
    1.29
    MONGO
    1.28
    1.28
     waiters
    1.23
    1.21
     thefe
    1.21
     mores
    1.20
    POSITIVE LOGITS
    را
    1.58
    ية
    1.30
    لا
    1.28
    و
    1.25
    1.23
    َ
    1.23
    اد
    1.21
    ні
    1.20
    પણે
    1.09
     américain
    1.06
    Act Density 0.076%

    No Known Activations