INDEX
    Explanations

    phrases related to social issues and conflicts

    New Auto-Interp
    Negative Logits
    reon
    -0.70
     largeDownload
    -0.68
    chwitz
    -0.67
    arching
    -0.65
     Rousse
    -0.63
    ilee
    -0.62
    ¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
    -0.62
    ophen
    -0.61
     Indra
    -0.60
    VERTISEMENT
    -0.59
    POSITIVE LOGITS
    pox
    1.29
     (<
    0.98
     increments
    0.92
     consolation
    0.90
     tweaks
    0.88
    atur
    0.87
     insignificant
    0.86
     fry
    0.85
     handful
    0.84
     incremental
    0.83
    Act Density 0.646%

    No Known Activations