INDEX
    Explanations

    references to scientific researchers and their contributions

    New Auto-Interp
    Negative Logits
    itis
    -0.07
    atee
    -0.06
    counter
    -0.06
    OUNTER
    -0.06
    fat
    -0.06
    ot
    -0.06
    CHO
    -0.06
     posts
    -0.06
    ounter
    -0.06
    val
    -0.06
    POSITIVE LOGITS
     lead
    0.14
    lead
    0.11
     Lead
    0.10
    Lead
    0.10
     co
    0.10
    _lead
    0.09
     authors
    0.08
    rray
    0.08
    ahren
    0.08
    olley
    0.08
    Act Density 0.010%

    No Known Activations