INDEX
Explanations
Years and awards
The neuron selectively responds to numeric tokens—especially years—in the text.
New Auto-Interp
Negative Logits
_impl
-0.06
_Level
-0.06
Nothing
-0.06
Ella
-0.06
Finance
-0.06
Update
-0.06
'\"
-0.06
barley
-0.06
(schedule
-0.06
UserDefaults
-0.06
POSITIVE LOGITS
catalogs
0.07
#w
0.07
Ülke
0.07
Conn
0.06
lined
0.06
celebrated
0.06
۱۹۶
0.06
.ensure
0.06
_child
0.06
!");↵↵
0.06
Activations Density 0.012%