INDEX
Explanations
This neuron activates on mentions of initial participation or debut—tokens like “first,” “first-team,” “first-year,” and similar references to coming into the team.
New Auto-Interp
Negative Logits
BFS
-0.07
preempt
-0.07
.Areas
-0.06
ців
-0.06
措施
-0.06
_EL
-0.06
ですが
-0.06
Sp
-0.06
brero
-0.06
startswith
-0.06
POSITIVE LOGITS
lesh
0.07
ikan
0.07
nuevas
0.07
critic
0.06
.Json
0.06
Module
0.06
titulo
0.06
",
0.06
viewController
0.06
mesma
0.06
Activations Density 0.004%