INDEX
Explanations
code and URLs
This neuron never activates on any of the tokens—it’s essentially a “dead” neuron that doesn’t detect any pattern.
New Auto-Interp
Negative Logits
ráp
-0.08
уки
-0.07
misinformation
-0.07
[of
-0.06
AMP
-0.06
كار
-0.06
_Con
-0.06
inox
-0.06
chains
-0.06
№№
-0.06
POSITIVE LOGITS
Laud
0.07
مواط
0.06
modele
0.06
برگزار
0.06
.Deserialize
0.06
første
0.06
(eventName
0.06
/{0.06
車
0.06
orrar
0.06
Activations Density 0.005%