INDEX
    Explanations

    references to academic titles and affiliations

    New Auto-Interp
    Negative Logits
    anela
    -0.17
    _dispatch
    -0.15
    isted
    -0.15
    PCA
    -0.15
     Trident
    -0.14
     Watson
    -0.14
     Straw
    -0.14
     Ranger
    -0.14
    acea
    -0.14
    ld
    -0.14
    POSITIVE LOGITS
     Sunny
    0.28
     Sick
    0.27
     Cum
    0.19
     sick
    0.18
     Sinai
    0.17
    ICES
    0.17
     CI
    0.17
    Cum
    0.16
    .ci
    0.16
     Schul
    0.16
    Act Density 0.007%

    No Known Activations