INDEX
    Explanations

    names and institutions related to academia

    New Auto-Interp
    Negative Logits
    sville
    -0.14
     olsa
    -0.13
    oul
    -0.13
    ãĤ¾
    -0.13
    .mozilla
    -0.13
    µ¬
    -0.13
     reg
    -0.13
    iring
    -0.13
    rlen
    -0.13
    ecast
    -0.12
    POSITIVE LOGITS
     pinterest
    0.16
    ifice
    0.16
    iến
    0.14
     Tomáš
    0.14
    osten
    0.14
    ixed
    0.14
    erna
    0.14
    oreach
    0.13
    ovah
    0.13
    ÃŃs
    0.13
    Act Density 0.163%

    No Known Activations