INDEX
    Explanations

    quotes from notable figures

    New Auto-Interp
    Negative Logits
    arrera
    -0.18
    ÃľR
    -0.16
    ault
    -0.15
    unden
    -0.15
    anlı
    -0.15
    ÄįnÃŃk
    -0.14
    agh
    -0.14
    erah
    -0.14
    uder
    -0.14
    hurst
    -0.14
    POSITIVE LOGITS
    uppe
    0.15
     Hayward
    0.14
    ickers
    0.14
    ATIONAL
    0.14
    Ñĥп
    0.14
     cep
    0.13
     multipart
    0.13
    ajar
    0.13
    ":""
    0.13
    izer
    0.13
    Act Density 0.113%

    No Known Activations