INDEX
    Explanations

    references to specific political figures and entities

    references to significant events and figures related to political resistance and protests

    New Auto-Interp
    Negative Logits
    phal
    -0.84
    ties
    -0.80
    loving
    -0.80
    lishes
    -0.78
    marine
    -0.77
    ty
    -0.76
     Spit
    -0.75
    friend
    -0.73
    win
    -0.73
    trap
    -0.71
    POSITIVE LOGITS
     Lerner
    0.77
     Standing
    0.75
    imester
    0.74
     Garland
    0.72
    orsi
    0.71
     hearings
    0.70
    iffs
    0.70
    ón
    0.69
    ograp
    0.68
     pollen
    0.68
    Act Density 0.019%

    No Known Activations