INDEX
Explanations
quotation marks and punctuation surrounding quotes
punctuation marks, particularly quotes
quotation marks/apostrophes
New Auto-Interp
Negative Logits
’”
-1.23
,’”
-1.22
.’”
-1.20
’).
-1.15
’,
-1.14
”…
-1.14
’.”
-1.13
’)
-1.13
.”)
-1.13
’:
-1.13
POSITIVE LOGITS
"
0.94
'
0.91
'
0.78
"
0.75
"'
0.60
"'
0.56
-'
0.49
--"
0.46
--
0.44
..."
0.44
Activations Density 1.192%