INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    questions
    -0.07
    assignment
    -0.07
    _STMT
    -0.07
     Blind
    -0.07
     programas
    -0.06
    .Cascade
    -0.06
     Dis
    -0.06
     خد
    -0.06
     خواست
    -0.06
     range
    -0.06
    POSITIVE LOGITS
     synthetic
    0.10
     Synthetic
    0.08
    webElementX
    0.06
    она
    0.06
    .TextView
    0.06
    ienia
    0.06
    мат
    0.06
     twins
    0.06
    إنجليزية
    0.06
     ули
    0.06
    Act Density 0.002%

    No Known Activations