INDEX
    Explanations

    specifically named authors in academic paper citations

    references to authors or collaborators in cited academic work

    New Auto-Interp
    Negative Logits
    OUNT
    -0.78
    canon
    -0.75
    FUL
    -0.73
    esses
    -0.73
    rontal
    -0.69
    OWS
    -0.67
    Meat
    -0.66
    ACP
    -0.63
    ardless
    -0.63
    IFT
    -0.61
    POSITIVE LOGITS
     seq
    1.21
    rics
    0.92
     al
    0.89
    ween
    0.82
    sis
    0.81
    ree
    0.81
    hetically
    0.80
     Associates
    0.74
    lace
    0.74
    iated
    0.73
    Act Density 0.011%

    No Known Activations