INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Details
    -0.07
     Das
    -0.07
     userDao
    -0.06
    λαμβ
    -0.06
    Feel
    -0.06
     Wohnung
    -0.06
    -0.06
     entertainment
    -0.06
     χρη
    -0.06
     Evo
    -0.06
    POSITIVE LOGITS
    ifa
    0.07
     xmax
    0.06
    .failure
    0.06
    ия
    0.06
    ожд
    0.06
    (employee
    0.06
    ,Yes
    0.06
    мон
    0.06
    apses
    0.06
    ackages
    0.06
    Act Density 0.027%

    No Known Activations