INDEX
    Explanations

    mentions of writers and writing

    New Auto-Interp
    Negative Logits
    anter
    -0.08
    ERT
    -0.08
    eur
    -0.08
    edu
    -0.08
    tep
    -0.08
    ayers
    -0.08
    tsy
    -0.07
    ilion
    -0.07
    adera
    -0.07
    ÑĤим
    -0.07
    POSITIVE LOGITS
    hip
    0.08
    hood
    0.07
    /editor
    0.07
    innen
    0.07
    /auth
    0.07
    itative
    0.06
    prene
    0.06
    lady
    0.06
    /art
    0.06
     Indies
    0.06
    Act Density 0.012%

    No Known Activations