INDEX
    Explanations

    references to specific groups or communities

    New Auto-Interp
    Negative Logits
    alion
    -0.19
     ãģ¿
    -0.16
    .nlm
    -0.16
    utom
    -0.15
     INCIDENT
    -0.15
    unde
    -0.15
    wish
    -0.14
    ayıp
    -0.14
    Ðĩ
    -0.14
    Ïģον
    -0.14
    POSITIVE LOGITS
     protest
    0.17
     vent
    0.17
     say
    0.17
     increasingly
    0.17
     Mull
    0.17
     urged
    0.16
     React
    0.16
     shouldn
    0.16
     face
    0.16
    React
    0.15
    Act Density 0.145%

    No Known Activations