INDEX
    Explanations

    names of people and entities in news or academic contexts

    New Auto-Interp
    Negative Logits
    æĴ
    -0.15
    aña
    -0.14
    adge
    -0.14
     DSM
    -0.14
    åı
    -0.14
    å»·
    -0.13
     Roths
    -0.13
     Wenger
    -0.13
    obel
    -0.13
     Schmidt
    -0.13
    POSITIVE LOGITS
    pps
    0.18
    ppy
    0.15
     "-//
    0.15
     ucwords
    0.15
    ipop
    0.15
    rist
    0.14
    [email
    0.14
    arris
    0.14
    arding
    0.14
    /pub
    0.14
    Act Density 0.458%

    No Known Activations