INDEX
Explanations
Initials
The neuron activates on short uppercase abbreviations or initials punctuated with periods (e.g. “S.U.”, “F.C.”, “N.G.”).
New Auto-Interp
Negative Logits
ATT
-0.07
Microsystems
-0.06
Init
-0.06
Esto
-0.06
它们
-0.06
-mouth
-0.06
راه
-0.06
938
-0.05
armies
-0.05
init
-0.05
POSITIVE LOGITS
_amount
0.07
Hairst
0.07
.J
0.07
.credit
0.07
Licence
0.07
.SE
0.07
.Sc
0.07
grew
0.07
.H
0.07
.D
0.07
Activations Density 0.020%