INDEX
    Explanations

    words related to social and political issues, including oppression, defiance, and activism

    New Auto-Interp
    Negative Logits
    abase
    -0.86
    wikipedia
    -0.76
    baum
    -0.74
    zzo
    -0.73
    ĸļ
    -0.72
    atari
    -0.71
    Base
    -0.71
     Keys
    -0.70
    lear
    -0.70
    eport
    -0.70
    POSITIVE LOGITS
     bloodshed
    1.53
     mayhem
    1.41
     persecution
    1.39
     injustice
    1.37
     instability
    1.35
     violence
    1.35
     oppression
    1.34
     repression
    1.33
     vandalism
    1.32
     strife
    1.32
    Act Density 0.236%

    No Known Activations