INDEX
    Explanations

    years or dates in a specific format

    negative expressions related to sports seasons and performance metrics

    New Auto-Interp
    Negative Logits
     Zoro
    -0.71
     Dickinson
    -0.69
    ensor
    -0.68
     scanner
    -0.65
     Harmon
    -0.65
     thumbs
    -0.64
     Carrier
    -0.63
     Gim
    -0.62
     moder
    -0.62
     Feather
    -0.61
    POSITIVE LOGITS
    2014
    1.18
    2016
    1.18
    2012
    1.17
    2017
    1.16
    2020
    1.14
    2011
    1.12
    present
    1.11
    2013
    1.11
    2018
    1.11
    2015
    1.10
    Act Density 0.037%

    No Known Activations