INDEX
Explanations
This neuron fires on the first token of new sentences or major sections—i.e. sentence-initial words.
New Auto-Interp
Negative Logits
.sky
-0.07
Lease
-0.07
Size
-0.07
Stan
-0.06
surrogate
-0.06
erialized
-0.06
شي
-0.06
^{[-0.06
[@
-0.06
address
-0.06
POSITIVE LOGITS
":"","
0.06
Ông
0.06
(ns
0.06
ImageSharp
0.06
ugging
0.06
.PARAM
0.06
researching
0.06
supplementary
0.06
##_
0.06
postup
0.06
Activations Density 0.032%