INDEX
    Explanations

    phrases related to activism and social causes

    repeated phrases or expressions that emphasize a sentiment or idea indicating frustration or discontent

    New Auto-Interp
    Negative Logits
     adolesc
    -0.82
     mathemat
    -0.81
     hemor
    -0.80
    anium
    -0.79
     Seym
    -0.78
     imitation
    -0.74
     fortun
    -0.74
    oscope
    -0.74
     captives
    -0.72
     Palestin
    -0.71
    POSITIVE LOGITS
    ï¸
    1.02
    ï¸ı
    0.90
    Balt
    0.82
    own
    0.78
    Ru
    0.77
    nder
    0.76
    ¯
    0.75
    女
    0.74
    ishable
    0.73
    wise
    0.73
    Act Density 0.231%

    No Known Activations