INDEX
Explanations
phrases related to a sense of discontent with societal issues
Tokens appearing before usernames or signatures
user names or identifiers following "by"
New Auto-Interp
Negative Logits
UIControlState
-0.86
poffe
-0.79
uſed
-0.76
بوابة
-0.74
ſtand
-0.73
fhew
-0.72
ArgsConstructor
-0.71
ſelf
-0.71
pleaſure
-0.71
diſt
-0.70
POSITIVE LOGITS
@
0.67
Anonymous
0.66
Mr
0.66
j
0.64
Mr
0.61
anonymous
0.61
Anonymous
0.57
k
0.56
anon
0.56
@
0.54
Activations Density 0.175%