INDEX
    Explanations

    dates in the format "December [day]"

    New Auto-Interp
    Negative Logits
    aird
    -0.79
     juggling
    -0.71
    sterdam
    -0.71
    yright
    -0.69
     Ital
    -0.67
    zee
    -0.64
    geries
    -0.63
    XY
    -0.62
    cher
    -0.62
    enh
    -0.61
    POSITIVE LOGITS
    EMBER
    0.97
     2011
    0.94
     1941
    0.93
     2012
    0.93
     2013
    0.93
     2015
    0.92
     2010
    0.91
     2014
    0.88
     2009
    0.86
     2006
    0.86
    Act Density 0.024%

    No Known Activations