INDEX
Explanations
punctuation
The neuron activates on tokens that mark dates, years, and other numerical career‐timeline references in a biography.
New Auto-Interp
Negative Logits
Cos
-0.07
〈
-0.07
icina
-0.07
Emp
-0.06
uit
-0.06
NK
-0.06
/mit
-0.06
ikes
-0.06
illas
-0.06
Thursday
-0.06
POSITIVE LOGITS
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
0.07
247
0.07
[dim
0.06
Warn
0.06
getState
0.06
=\"$
0.06
<class
0.06
window
0.06
.savefig
0.06
ίναι
0.06
Activations Density 0.042%