INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     خل
    -0.07
     Kurdish
    -0.06
    annotate
    -0.06
    shadow
    -0.06
     NGX
    -0.06
    coni
    -0.06
    bilir
    -0.06
    gün
    -0.06
     Sharia
    -0.06
    daq
    -0.06
    POSITIVE LOGITS
    0.06
    ा:
    0.06
     DAL
    0.06
    0.06
     cáo
    0.06
    版本
    0.06
    casts
    0.05
    plets
    0.05
    ์เน
    0.05
    líž
    0.05
    Act Density 0.275%

    No Known Activations