INDEX
Explanations
This neuron never activates—that is, it does not respond to any tokens.
New Auto-Interp
Negative Logits
ezier
-0.06
_detected
-0.06
卖
-0.06
champagne
-0.06
quential
-0.06
/mit
-0.06
lications
-0.06
idols
-0.06
comparator
-0.06
衝
-0.06
POSITIVE LOGITS
_stand
0.07
áři
0.07
IOC
0.07
.New
0.06
{})0.06
пред
0.06
{}]0.06
odpově
0.06
sæ
0.06
#{0.06
Activations Density 0.008%