INDEX
    Explanations

    content related to violence, crime, and social unrest

    instances of violence and oppression

    New Auto-Interp
    Negative Logits
     congr
    -0.74
     Tycoon
    -0.69
    ernaut
    -0.67
    ellation
    -0.66
     proponent
    -0.64
    ultimate
    -0.64
    framework
    -0.61
     Ranking
    -0.61
    inventoryQuantity
    -0.61
     nutshell
    -0.60
    POSITIVE LOGITS
     balcon
    1.03
     indiscrim
    0.95
     roofs
    0.86
     sidewalks
    0.85
     carts
    0.84
     etc
    0.83
     detainees
    0.83
     throats
    0.79
     passers
    0.79
     kitchens
    0.77
    Act Density 0.503%

    No Known Activations