INDEX
Explanations
This neuron doesn’t respond to any input—it remains inactive for all tokens.
New Auto-Interp
Negative Logits
佐
-0.08
_from
-0.07
PRI
-0.07
Outs
-0.07
freeway
-0.06
ころ
-0.06
λεύ
-0.06
بشر
-0.06
правиль
-0.06
iterated
-0.06
POSITIVE LOGITS
ños
0.06
bourgeoisie
0.06
picking
0.06
다양한
0.06
IKE
0.06
şans
0.06
.gson
0.06
’s
0.06
-spe
0.06
etSocketAddress
0.06
Activations Density 0.117%