INDEX
Explanations
The neuron lights up on proper names and branded sports entities—e.g. player names, team names, and similar named‐entity tokens.
New Auto-Interp
Negative Logits
liquids
-0.07
-0.07
yx
-0.07
602
-0.07
phis
-0.07
сол
-0.07
بازار
-0.07
iks
-0.06
_extractor
-0.06
igma
-0.06
POSITIVE LOGITS
acknowledges
0.06
.arguments
0.06
велич
0.06
//$
0.06
(category
0.06
좌
0.06
anneer
0.06
.permissions
0.06
/bootstrap
0.06
tparam
0.06
Activations Density 0.032%