INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     میان
    -0.07
     defaultdict
    -0.07
     عاش
    -0.06
     WebDriverWait
    -0.06
     similarities
    -0.06
    组织
    -0.06
    کس
    -0.06
    Member
    -0.06
     give
    -0.06
    bots
    -0.06
    POSITIVE LOGITS
    AVOR
    0.07
    calculator
    0.07
     spared
    0.07
     figured
    0.07
    logged
    0.06
    /world
    0.06
     있음
    0.06
     ROUT
    0.06
    indrical
    0.06
     الطبي
    0.06
    Act Density 0.171%

    No Known Activations