INDEX
Explanations
instances of punctuation, particularly commas and quotation marks
New Auto-Interp
Negative Logits
significant
-0.39
,
-0.38
↵
-0.38
and
-0.34
↵↵
-0.34
Reif
-0.34
SEVER
-0.33
’
-0.32
severe
-0.30
'
-0.29
POSITIVE LOGITS
),”
1.23
.’”
1.23
,’”
1.22
?”
1.20
?”.
1.19
...”
1.17
).”
1.17
,”
1.17
,'"
1.17
.”
1.16
Activations Density 0.197%