INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     joyful
    -0.07
     imposition
    -0.06
    aşı
    -0.06
    تز
    -0.06
     Ole
    -0.06
    olah
    -0.06
    .FIELD
    -0.06
    Translation
    -0.06
    .Widget
    -0.06
    aliz
    -0.06
    POSITIVE LOGITS
    _Mode
    0.07
     sweat
    0.06
    идента
    0.06
    -state
    0.06
     Language
    0.06
    etermined
    0.06
    Ğ
    0.06
     register
    0.06
    чика
    0.06
     recycling
    0.06
    Act Density 0.000%

    No Known Activations