INDEX
Explanations
The neuron responds to structural or formatting markers—punctuation (colons, commas, quotes, parentheses), numeric tokens (years, numbers), and similar layout cues.
New Auto-Interp
Negative Logits
-db
-0.06
Down
-0.06
Nam
-0.06
Containers
-0.06
Mic
-0.06
Instr
-0.06
looming
-0.06
Antique
-0.06
asshole
-0.06
Meh
-0.06
POSITIVE LOGITS
verdade
0.07
ность
0.07
frac
0.06
peu
0.06
็บ
0.06
عة
0.06
Connection
0.06
quad
0.06
ัตร
0.06
۱۹۷
0.06
Activations Density 0.184%