INDEX
    Explanations

    phrases related to political statements or discussions

    New Auto-Interp
    Negative Logits
     shroud
    -0.79
     couch
    -0.68
     semblance
    -0.67
     Belg
    -0.65
     hemor
    -0.63
     Doodle
    -0.63
     taxp
    -0.62
     guiActiveUnfocused
    -0.62
    entary
    -0.62
     canvas
    -0.62
    POSITIVE LOGITS
    ª
    1.28
    ¹
    1.23
    ¸
    1.10
    ł
    1.08
    IJ
    1.08
    ij
    1.06
    ı
    1.03
    ¤
    1.01
    ¡
    1.00
    ³
    0.99
    Act Density 0.154%

    No Known Activations