INDEX
Explanations
no clear pattern
structural formatting cues indicating lists and outlines, such as section headers, numbered items, and bullet-point subpoints.
New Auto-Interp
Negative Logits
checksum
0.19
dimensionality
0.18
minimise
0.17
splatter
0.17
minimising
0.17
accum
0.16
minimization
0.16
astral
0.16
ambiguity
0.16
imbalance
0.16
POSITIVE LOGITS
0.19
0.17
0.17
0.17
0.16
0.16
0.16
0.15
0.15
0.15
Activations Density 0.675%