INDEX
    Explanations

    four divisions, Arabic, Wikipedia

    New Auto-Interp
    Negative Logits
    ρι
    0.41
    dere
    0.41
     হইতেছে
    0.39
    0.39
    됐다
    0.38
    ناك
    0.37
     പ്രവര്‍ത്തന
    0.37
     ду
    0.36
    dados
    0.36
    óricos
    0.36
    POSITIVE LOGITS
     ويكيپيديا
    0.63
     تكتب
    0.47
     تكون
    0.46
     kuwa
    0.41
     onu
    0.41
    0.40
     Wikipédia
    0.39
     Catawiki
    0.39
     keeping
    0.39
     Carrera
    0.39
    Act Density 0.000%

    No Known Activations