INDEX
Explanations
Portions
This neuron detects the word “Portions” (as used in copyright/license headers).
New Auto-Interp
Negative Logits
SWG
-0.06
ctx
-0.06
dap
-0.06
webs
-0.06
chili
-0.06
ertura
-0.06
lub
-0.06
climb
-0.06
editar
-0.06
آنان
-0.06
POSITIVE LOGITS
(cos
0.06
udden
0.06
adiator
0.06
.Minute
0.06
.getChild
0.06
(home
0.06
ewriter
0.06
assets
0.06
■
0.06
undertaken
0.06
Activations Density 0.003%