INDEX
    Explanations

    proper nouns and locations

    New Auto-Interp
    Negative Logits
     Sins
    -0.71
     Noir
    -0.69
     Preferred
    -0.65
     Scarlet
    -0.63
     Emin
    -0.62
     Attribution
    -0.61
     Ivory
    -0.59
    backer
    -0.58
     Reson
    -0.57
     CPC
    -0.57
    POSITIVE LOGITS
    prising
    1.32
    seless
    1.25
    pperc
    1.17
    nexpected
    1.15
    berman
    1.10
    pees
    1.07
    mber
    1.07
    pee
    1.05
    gly
    1.03
    pport
    1.01
    Act Density 3.324%

    No Known Activations