INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     needy
    -0.08
     معامل
    -0.07
    чысы
    -0.07
    ורך
    -0.07
    important
    -0.07
     policym
    -0.07
    ighthouse
    -0.07
    -important
    -0.07
    -0.07
     playful
    -0.07
    POSITIVE LOGITS
     completed
    0.08
     Bắc
    0.08
    0.08
     mother's
    0.08
     ગયા
    0.08
     Boulevard
    0.07
    .completed
    0.07
    .Completed
    0.07
     creciendo
    0.07
    0.07
    Act Density 0.093%

    No Known Activations