INDEX
    Explanations

    discussion about social and political issues, such as feminism, climate change, social security, terrorism, and activism

    New Auto-Interp
    Negative Logits
     yacht
    -0.96
    Ô
    -0.95
     fortun
    -0.94
    è£ħ
    -0.91
    OUGH
    -0.91
    VERTISEMENT
    -0.90
    Detailed
    -0.90
    Angelo
    -0.88
    Berry
    -0.88
     Yose
    -0.88
    POSITIVE LOGITS
    might
    0.99
    hop
    0.98
    should
    0.97
    stood
    0.94
     embodied
    0.90
    walking
    0.89
    lead
    0.88
    uit
    0.88
    ãĤ±
    0.86
    sheets
    0.86
    Act Density 1.630%

    No Known Activations