INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ك
    0.86
    %>%
    0.83
     في
    0.71
     وي
    0.69
    0.67
    ير
    0.66
    ́t
    0.66
    ́u
    0.64
    ת
    0.63
     ويع
    0.63
    POSITIVE LOGITS
     tasa
    0.59
    h
    0.59
     galactose
    0.57
     pross
    0.56
     بیٹے
    0.56
    0.54
     rate
    0.53
    nios
    0.53
    N
    0.53
     निकल
    0.52
    Act Density 0.002%

    No Known Activations