INDEX
Explanations
similar to, time, increase, how
emphasized or standout key terms and headings in structured instructional text, especially those marked by formatting cues (bold/italics, quotes, slashes, or code-style tokens).
New Auto-Interp
Negative Logits
and
0.29
to
0.28
,
0.28
or
0.25
و
0.25
œuvres
0.25
-
0.25
(
0.24
–
0.24
0.23
POSITIVE LOGITS
<unused1861>
0.28
<unused742>
0.24
<unused2037>
0.23
<unused717>
0.23
their
0.23
<unused321>
0.23
<unused1774>
0.23
<unused661>
0.22
AMP
0.22
<unused1658>
0.22
Activations Density 2.071%