INDEX
    Explanations

    phrases related to civil unrest or societal issues of injustice

    New Auto-Interp
    Negative Logits
    oka
    -0.69
    0000000000000000
    -0.68
    acles
    -0.66
    utenberg
    -0.63
    Password
    -0.62
    ropri
    -0.62
    resy
    -0.61
    Enlarge
    -0.61
    arten
    -0.59
    ially
    -0.59
    POSITIVE LOGITS
    grown
    0.79
    country
    0.79
    drive
    0.74
     again
    0.73
    iflower
    0.73
    lake
    0.72
    roads
    0.72
    sites
    0.71
    reach
    0.69
     town
    0.69
    Act Density 0.023%

    No Known Activations