INDEX
Explanations
repeated punctuation or structural markers in text
New Auto-Interp
Negative Logits
sprung
-0.80
20439
-0.73
æ©
-0.68
ãĥ¼ãĥĨ
-0.64
*/(
-0.64
ciation
-0.64
enta
-0.63
overd
-0.63
nodd
-0.63
bledon
-0.61
POSITIVE LOGITS
.
1.27
._
1.03
..
1.02
.)
0.94
*.
0.94
."
0.90
-.
0.86
]
0.85
(.
0.81
..................
0.80
Activations Density 0.003%