INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    xis
    -0.07
     Pan
    -0.07
    -type
    -0.07
     minWidth
    -0.06
    TX
    -0.06
    581
    -0.06
     Anch
    -0.06
    gun
    -0.06
    _specific
    -0.06
     inside
    -0.06
    POSITIVE LOGITS
     о
    0.08
     معت
    0.07
    isini
    0.07
    تغ
    0.07
    obia
    0.07
     PIO
    0.07
     nghỉ
    0.07
     معل
    0.07
     për
    0.07
    horia
    0.06
    Act Density 0.011%

    No Known Activations