INDEX
Explanations
The neuron fires on numeric tokens—especially instances of the digit “4” in contexts like “last 4 digits.”
New Auto-Interp
Negative Logits
nému
-0.07
payload
-0.07
poc
-0.06
infection
-0.06
ran
-0.06
Ruf
-0.06
мон
-0.06
нею
-0.06
oo
-0.06
coppia
-0.06
POSITIVE LOGITS
(['/
0.06
eski
0.06
楽
0.06
pylint
0.06
MX
0.06
compounded
0.06
express
0.05
eventName
0.05
JSONObject
0.05
قن
0.05
Activations Density 0.004%