INDEX
Explanations
Equations
This neuron never activates—it does not respond to any token (it’s essentially “dead”).
New Auto-Interp
Negative Logits
ايران
-0.07
组织
-0.06
newbie
-0.06
_extend
-0.06
itbart
-0.06
міну
-0.06
believed
-0.06
smugg
-0.06
jím
-0.06
converted
-0.06
POSITIVE LOGITS
controle
0.07
možnost
0.06
(priority
0.06
numeral
0.06
mimetype
0.06
Manning
0.06
Senators
0.06
itt
0.06
JL
0.06
Đối
0.06
Activations Density 0.058%