INDEX
    Explanations

    references to a specific person, specifically "Sanders."

    mentions of the name "Sanders."

    New Auto-Interp
    Negative Logits
    ocaust
    -0.87
    ãĥ¼ãĥĨãĤ£
    -0.78
    obar
    -0.77
     tyr
    -0.74
    obe
    -0.74
    othy
    -0.72
    ãĥ¼ãĥĨ
    -0.71
    ogly
    -0.71
    ————
    -0.70
    onut
    -0.69
    POSITIVE LOGITS
     Sanders
    1.12
     Supporters
    1.01
     supporters
    0.95
    Sanders
    0.92
     Caucus
    0.84
     supporter
    0.82
     delegates
    0.81
    '
    0.79
     Bros
    0.79
    rade
    0.77
    Act Density 0.020%

    No Known Activations