INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     cumbersome
    -0.08
    -0.08
     принадлеж
    -0.08
     шам
    -0.07
    -0.07
     futur
    -0.07
     banging
    -0.07
     перек
    -0.07
     сатып
    -0.07
     bumili
    -0.07
    POSITIVE LOGITS
     Avg
    0.10
     conceded
    0.09
     suffered
    0.09
    Avg
    0.08
     Nassau
    0.08
    occur
    0.08
    avg
    0.08
     donkey
    0.08
     Patent
    0.08
     seasoned
    0.08
    Act Density 0.009%

    No Known Activations