INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    oute
    -0.07
     Global
    -0.07
    ',
    -0.06
    (c
    -0.06
     noting
    -0.06
    (ns
    -0.06
    -store
    -0.06
    lun
    -0.06
    Upper
    -0.06
     oddly
    -0.06
    POSITIVE LOGITS
    σιμο
    0.07
     діяльність
    0.06
    oliday
    0.06
    formace
    0.06
     nonatomic
    0.06
    스토
    0.06
    imbabwe
    0.06
    objs
    0.06
    κού
    0.06
     appare
    0.06
    Act Density 0.024%

    No Known Activations