INDEX
Explanations
looking down
The neuron is detecting special control or markup tokens (e.g. header/start-of-text markers) rather than normal words.
New Auto-Interp
Negative Logits
Password
-0.06
(pop
-0.06
neurotrans
-0.06
ofday
-0.06
ass
-0.06
rends
-0.06
Summon
-0.06
свид
-0.06
Jay
-0.06
oben
-0.06
POSITIVE LOGITS
/min
0.07
ческой
0.07
Networking
0.07
_shader
0.06
implementing
0.06
ment
0.06
Auss
0.06
noe
0.06
timeofday
0.06
обращ
0.06
Activations Density 0.006%