INDEX
    Explanations

    phrases related to a specific person

    mentions of specific individuals, particularly related to sports

    New Auto-Interp
    Negative Logits
     Catalyst
    -0.68
     replay
    -0.68
     Marketable
    -0.61
     judgment
    -0.61
     misinformation
    -0.61
    lihood
    -0.60
    ++++++++++++++++
    -0.60
    mble
    -0.60
    Journal
    -0.59
     Grateful
    -0.59
    POSITIVE LOGITS
     Poc
    1.10
    oco
    0.98
     Hots
    0.98
    hett
    0.97
    otte
    0.94
    atell
    0.90
    het
    0.87
    rolet
    0.83
    jas
    0.82
    seys
    0.82
    Act Density 0.006%

    No Known Activations