INDEX
    Explanations

    references to individuals and their achievements or notable traits

    New Auto-Interp
    Negative Logits
    ober
    -0.15
    ubern
    -0.15
    USTER
    -0.14
    bron
    -0.14
    antz
    -0.14
     Bowling
    -0.14
     Mare
    -0.13
    teri
    -0.13
    bern
    -0.13
    irst
    -0.13
    POSITIVE LOGITS
    nt
    0.16
    nu
    0.15
    ona
    0.14
    /manage
    0.14
    PFN
    0.14
     Kelley
    0.14
     donc
    0.14
    ortho
    0.14
    removeAttr
    0.13
    zsche
    0.13
    Act Density 0.095%

    No Known Activations