INDEX
Explanations
punctuation
The neuron primarily activates on numeric tokens (years, numbers, and other digit sequences).
New Auto-Interp
Negative Logits
.WEST
-0.07
entertaining
-0.07
birth
-0.07
Tam
-0.07
Rivers
-0.07
_DIV
-0.07
<object
-0.06
.Download
-0.06
Y
-0.06
aguay
-0.06
POSITIVE LOGITS
ática
0.06
-↵
0.06
reset
0.06
려요
0.06
textbook
0.06
index
0.06
onCreateOptionsMenu
0.06
cream
0.06
WindowState
0.05
/)↵
0.05
Activations Density 0.056%