INDEX
Explanations
language indicating errors or issues
instances of the word "error" and its variations in the text
New Auto-Interp
Negative Logits
APTER
-0.85
apeake
-0.76
nai
-0.76
amen
-0.74
electric
-0.73
±
-0.72
idal
-0.71
bors
-0.70
apy
-0.70
bledon
-0.67
POSITIVE LOGITS
error
0.99
guiActiveUn
0.99
ously
0.96
errors
0.85
Error
0.80
margin
0.76
deceive
0.75
gered
0.74
offend
0.72
error
0.71
Activations Density 0.019%