INDEX
    Explanations

    specific entities, mostly related to incidents or actions taken against them

    references to personal experiences or identities of individuals

    New Auto-Interp
    Negative Logits
     Ital
    -0.67
     Cards
    -0.67
     Readers
    -0.65
     Remastered
    -0.64
     Legend
    -0.64
     VIDEOS
    -0.63
     Retirement
    -0.63
     Proced
    -0.63
     Rewards
    -0.62
     Words
    -0.61
    POSITIVE LOGITS
    soever
    1.00
     currently
    0.92
     otherwise
    0.88
     supposedly
    0.83
     previously
    0.81
    needed
    0.77
     pesky
    0.76
     belonged
    0.76
     resulted
    0.75
     allegedly
    0.75
    Act Density 0.318%

    No Known Activations