INDEX
Explanations
a specific repeated character or symbol
New Auto-Interp
Negative Logits
abi
-0.16
orde
-0.16
ÑĪÑĤ
-0.15
serter
-0.15
ypad
-0.14
аÑĢаÑĤ
-0.14
ivating
-0.14
oupper
-0.14
ghi
-0.14
Pub
-0.14
POSITIVE LOGITS
Bomb
0.18
.sys
0.17
bomb
0.17
ãĥªãĤ¹
0.16
Bomb
0.16
bomb
0.15
ipt
0.15
Jones
0.15
sleeper
0.15
bombing
0.15
Activations Density 0.010%