INDEX
Explanations
patterns of repeated characters or symbols in sequences
repeated sequences or patterns in the text
New Auto-Interp
Negative Logits
-1.20
-1.02
-0.84
-0.82
”
-0.82
)
-0.79
-0.78
?
-0.78
-0.78
-0.77
POSITIVE LOGITS
0.87
photolibrary
0.82
0.75
↵
0.71
0.68
myſelf
0.66
ARXIV
0.65
↵↵
0.64
defaultstate
0.63
0.61
Activations Density 1.298%