INDEX
Explanations
The neuron fires on standalone numeric tokens—particularly those denoting years, volume/page numbers, and other citation‐style numerals.
New Auto-Interp
Negative Logits
уч
-0.07
_lc
-0.07
alt
-0.07
accumulate
-0.07
_SOURCE
-0.07
رشد
-0.06
hare
-0.06
모두
-0.06
tín
-0.06
Reading
-0.06
POSITIVE LOGITS
Put
0.07
خاب
0.06
يار
0.06
WP
0.06
righteous
0.06
nection
0.06
metics
0.06
tattoo
0.06
ини
0.06
ительные
0.06
Activations Density 0.014%