INDEX
    Explanations

    comments related to sports or performance, particularly focused on assessing player performance

    instances of the word "play" in various contexts, particularly in reference to sports or performance

    New Auto-Interp
    Negative Logits
     Beir
    -0.67
     ren
    -0.66
     9000
    -0.62
     Shipping
    -0.58
     Reich
    -0.58
    ITED
    -0.57
     ink
    -0.57
     tow
    -0.57
     campaigned
    -0.57
     relinqu
    -0.57
    POSITIVE LOGITS
    wright
    1.37
    ername
    1.29
    style
    1.25
    calling
    1.24
    maker
    1.19
    offs
    1.18
    styles
    1.15
    making
    1.15
    testing
    1.14
    makers
    1.13
    Act Density 0.042%

    No Known Activations