INDEX
    Explanations

    mentions related to societal or political issues within a country

    topics related to societal issues and injustices

    New Auto-Interp
    Negative Logits
    iago
    -0.73
    urat
    -0.73
    escription
    -0.72
    inyl
    -0.69
    letes
    -0.67
    uilt
    -0.60
    redit
    -0.59
     Yorkers
    -0.57
     Tycoon
    -0.57
    POR
    -0.56
    POSITIVE LOGITS
     circles
    0.86
     hierarchy
    0.79
    manship
    0.79
     classrooms
    0.78
     contexts
    0.76
     context
    0.76
     sphere
    0.75
     lately
    0.71
     today
    0.69
    ament
    0.68
    Act Density 0.446%

    No Known Activations