INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
     atan
    -0.06
     Kansas
    -0.06
     slick
    -0.06
     Articles
    -0.06
    .Empty
    -0.06
     безпеки
    -0.06
    TOR
    -0.06
    345
    -0.06
     Shades
    -0.06
    POSITIVE LOGITS
     Kathleen
    0.27
     Cynthia
    0.20
     Edwin
    0.19
    leen
    0.17
    ynthia
    0.17
     Janet
    0.12
     Theresa
    0.12
    ileen
    0.12
     Patricia
    0.12
     Judith
    0.11
    Act Density 0.004%

    No Known Activations