INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     unos
    -0.07
     saving
    -0.06
     Filtering
    -0.06
     sür
    -0.06
     yapılmış
    -0.06
     jméno
    -0.06
     weiber
    -0.06
     глуб
    -0.06
    (stmt
    -0.06
    .MATCH
    -0.06
    POSITIVE LOGITS
     TKey
    0.07
    dění
    0.06
    �u
    0.06
     legislature
    0.06
    swagen
    0.06
    wu
    0.06
     adolescents
    0.06
    0.06
    0.06
    istar
    0.06
    Act Density 0.013%

    No Known Activations