INDEX
Explanations
This neuron never activates—it doesn’t respond to any tokens.
New Auto-Interp
Negative Logits
Bear
-0.06
وئ
-0.06
puppet
-0.06
snake
-0.06
키
-0.06
ンズ
-0.06
Ticker
-0.06
_sms
-0.06
Typ
-0.06
starred
-0.06
POSITIVE LOGITS
UniformLocation
0.07
moveTo
0.07
.feedback
0.06
ACK
0.06
(bundle
0.06
_define
0.06
commodities
0.06
.scope
0.06
Lemma
0.06
NavLink
0.06
Activations Density 0.004%