INDEX
    Explanations

    phrases or contexts related to political discourse

    New Auto-Interp
    Negative Logits
    çľł
    -0.15
    ATER
    -0.15
    rh
    -0.15
    ÙĨس
    -0.14
    udi
    -0.14
    exampleInput
    -0.14
     Entire
    -0.14
    ãng
    -0.14
    ردÙĩ
    -0.14
    amoto
    -0.13
    POSITIVE LOGITS
     ready
    0.23
    -face
    0.23
    IQUE
    0.19
    .ready
    0.18
    usz
    0.17
    issen
    0.17
    -NLS
    0.17
    ToShow
    0.17
    issement
    0.17
     Ready
    0.16
    Act Density 0.040%

    No Known Activations