INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    izabeth
    -0.71
    uden
    -0.70
    cedes
    -0.69
    igham
    -0.68
    cellence
    -0.66
    ioch
    -0.65
    roo
    -0.65
    amines
    -0.64
    ifles
    -0.62
    speak
    -0.60
    POSITIVE LOGITS
     2015
    0.95
     2016
    0.93
     2017
    0.90
    flower
    0.87
     2014
    0.87
     2013
    0.85
     deadline
    0.85
     edition
    0.82
     2012
    0.82
     2011
    0.80
    Act Density 0.047%

    No Known Activations