INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    (X
    -0.07
     grid
    -0.06
     target
    -0.06
     resource
    -0.06
     новые
    -0.06
    ождение
    -0.06
     little
    -0.06
     Browns
    -0.06
    _dates
    -0.06
     buyer
    -0.06
    POSITIVE LOGITS
    semester
    0.08
     Seminar
    0.08
     همسر
    0.08
     SEM
    0.08
    eni
    0.08
     seminar
    0.08
     شیر
    0.07
     науков
    0.07
     semi
    0.07
    ismet
    0.07
    Act Density 0.009%

    No Known Activations