INDEX
    Explanations

    social media related information such as blocked accounts, following, and dates

    specific words or phrases related to being blocked or canceled in social media or similar contexts

    New Auto-Interp
    Negative Logits
     scout
    -0.85
    theless
    -0.78
     viz
    -0.72
     dart
    -0.72
     bolt
    -0.70
     cu
    -0.69
     coupled
    -0.69
     dj
    -0.67
     explorer
    -0.67
     paired
    -0.66
    POSITIVE LOGITS
    erity
    1.22
    rupted
    1.08
    izophren
    1.00
    ueless
    0.97
    rupt
    0.96
    seless
    0.95
    untary
    0.95
    secut
    0.94
    ificantly
    0.94
    bitious
    0.93
    Act Density 0.256%

    No Known Activations