INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    vana
    -0.80
    leness
    -0.76
    ONY
    -0.73
    otal
    -0.64
    selection
    -0.63
    ItemImage
    -0.63
    flush
    -0.63
     Cameroon
    -0.63
    lihood
    -0.62
    ensed
    -0.62
    POSITIVE LOGITS
     DC
    1.17
    sonian
    1.09
     Post
    1.08
     Capitals
    1.03
     Heights
    1.02
     Dull
    1.01
     Redskins
    0.94
     Examiner
    0.94
     Irving
    0.94
     Nationals
    0.91
    Act Density 0.030%

    No Known Activations