INDEX
Explanations
academic texts
This neuron detects structured outline headings—particularly the numbered modules, lessons, and section identifiers in a training/course index.
New Auto-Interp
Negative Logits
warming
-0.07
Agent
-0.06
-prop
-0.06
IFA
-0.06
-H
-0.06
�
-0.06
BP
-0.06
Mini
-0.06
character
-0.06
utoff
-0.06
POSITIVE LOGITS
???↵↵
0.07
391
0.07
그는
0.07
);//
0.07
appending
0.07
byl
0.06
Guidelines
0.06
.....↵↵
0.06
blames
0.06
RTE
0.06
Activations Density 0.135%