INDEX
Explanations
instances of specific formatting or tags likely related to structured data or headings
New Auto-Interp
Negative Logits
myſelf
-1.86
Efq
-1.82
―――――
-1.81
Theſe
-1.78
$_"
-1.76
\\
-1.70
^(@)
-1.69
Monfieur
-1.69
(\<
-1.68
Houſe
-1.67
POSITIVE LOGITS
,
2.02
<bos>
1.56
.
1.24
1.15
-
1.10
(
1.10
(
0.99
↵
0.98
and
0.89
:
0.87
Activations Density 0.132%