INDEX
Explanations
mentions of the term "Anonymous" or username-related content
instances of the word "Anonymous" and related terms
New Auto-Interp
Negative Logits
eele
-0.81
=-=-=-=-=-=-=-=-
-0.75
Gork
-0.74
asters
-0.70
rest
-0.70
++++++++++++++++
-0.69
efully
-0.66
tsky
-0.65
orses
-0.64
enegger
-0.64
POSITIVE LOGITS
Anonymous
0.81
ica
0.72
Anonymous
0.71
onymous
0.69
obliged
0.67
hacker
0.67
Warfare
0.66
ãĥĩ
0.65
cott
0.64
uthor
0.63
Activations Density 0.012%