INDEX
Explanations
conjunctions that indicate a connection or continuation between ideas
New Auto-Interp
Negative Logits
ses
-0.28
↵
-0.21
nt
-0.20
/her
-0.18
?s
-0.17
ÂŃs
-0.17
%s
-0.17
’t
-0.16
-t
-0.16
\s
-0.15
POSITIVE LOGITS
amp
0.76
nbsp
0.48
AMP
0.45
quot
0.42
raquo
0.42
ÑĶм
0.38
apos
0.38
ï¸ı
0.30
amp
0.30
emsp
0.29
Activations Density 0.064%