INDEX
    Explanations

    dates, numbers, locations, and rankings within texts

    New Auto-Interp
    Negative Logits
     glim
    -0.79
    othe
    -0.73
     needle
    -0.66
     shar
    -0.66
     polic
    -0.63
    ople
    -0.63
     serv
    -0.63
     corrid
    -0.62
     ranc
    -0.62
    omorph
    -0.60
    POSITIVE LOGITS
    2008
    1.75
     1995
    1.73
     2009
    1.73
    2010
    1.73
    2009
    1.72
     2010
    1.72
     1997
    1.72
     2008
    1.71
     1998
    1.71
     2005
    1.71
    Act Density 0.084%

    No Known Activations