INDEX
Explanations
This neuron does not consistently activate for any particular token sequence and appears not to detect any specific pattern.
New Auto-Interp
Negative Logits
Atlantis
-0.07
Traffic
-0.06
_CPP
-0.06
22
-0.06
ворю
-0.06
canActivate
-0.06
doctrine
-0.06
clared
-0.06
stacles
-0.06
уются
-0.06
POSITIVE LOGITS
innoc
0.07
-mask
0.07
фай
0.07
stitched
0.06
restore
0.06
sounded
0.06
彩
0.06
chiếc
0.06
.....
0.06
جع
0.06
Activations Density 0.025%