INDEX
    Explanations

    expressions of social and political opposition or conflict

    New Auto-Interp
    Negative Logits
    onus
    -0.16
    CW
    -0.15
    ehen
    -0.14
    oldur
    -0.14
    pcodes
    -0.14
    uchar
    -0.14
    cpt
    -0.14
    IntPtr
    -0.14
    ritic
    -0.14
    eniable
    -0.14
    POSITIVE LOGITS
     carriers
    0.17
     spreading
    0.15
     Huntington
    0.15
     ide
    0.15
     Band
    0.15
    æĦıè¯Ĩ
    0.15
     Wah
    0.15
    pseudo
    0.15
     Macron
    0.14
     tuyên
    0.14
    Act Density 0.004%

    No Known Activations