INDEX
Explanations
instances of brackets or curly braces in the text
New Auto-Interp
Negative Logits
él
-0.16
ppers
-0.15
odka
-0.14
aste
-0.14
émon
-0.14
zburg
-0.14
Katy
-0.14
ewn
-0.14
ót
-0.14
_accessor
-0.14
POSITIVE LOGITS
Score
0.18
advanced
0.18
scores
0.16
scored
0.16
score
0.16
Scores
0.16
Score
0.16
SCORE
0.16
advanced
0.16
score
0.15
Activations Density 0.000%