INDEX
    Explanations

    mentions of prestigious universities, particularly Yale

    New Auto-Interp
    Negative Logits
    ueur
    -0.17
    estre
    -0.16
    oulouse
    -0.16
    zek
    -0.15
    jsp
    -0.14
    ums
    -0.14
    olare
    -0.14
    idad
    -0.14
    ritis
    -0.14
    antro
    -0.14
    POSITIVE LOGITS
    areth
    0.16
    ighth
    0.16
    ilig
    0.15
    Accessible
    0.14
    quential
    0.14
    ORIZ
    0.14
    ÂŃi
    0.14
    ãĤ¹ãĤ«
    0.14
    QUI
    0.14
    ilen
    0.13
    Act Density 0.007%

    No Known Activations