INDEX
Explanations
Code brackets
This neuron activates on programming‐language syntax and identifiers (i.e. source‐code tokens) rather than natural‐language text.
New Auto-Interp
Negative Logits
lif
-0.06
Downloader
-0.06
wash
-0.06
pres
-0.06
flavours
-0.06
vg
-0.06
rio
-0.06
мощ
-0.06
eres
-0.05
bios
-0.05
POSITIVE LOGITS
Warren
0.08
AttributeSet
0.07
sum
0.07
نا
0.07
ему
0.07
添加
0.07
بد
0.07
taxable
0.07
میل
0.07
.resolve
0.06
Activations Density 0.057%