INDEX
    Explanations

    phrases related to a sense of discontent with societal issues

    Tokens appearing before usernames or signatures

    user names or identifiers following "by"

    New Auto-Interp
    Negative Logits
    UIControlState
    -0.86
     poffe
    -0.79
     uſed
    -0.76
    بوابة
    -0.74
     ſtand
    -0.73
     fhew
    -0.72
    ArgsConstructor
    -0.71
    ſelf
    -0.71
     pleaſure
    -0.71
     diſt
    -0.70
    POSITIVE LOGITS
     @
    0.67
     Anonymous
    0.66
     Mr
    0.66
     j
    0.64
    Mr
    0.61
     anonymous
    0.61
    Anonymous
    0.57
     k
    0.56
    anon
    0.56
    @
    0.54
    Act Density 0.175%

    No Known Activations