INDEX
Explanations
Examples and options
The neuron activates on wording that frames or links step-by-step instructional guidance (e.g. “follow along with me,” “trying the examples,” “in this tutorial”).
New Auto-Interp
Negative Logits
accidentally
-0.07
κε
-0.07
basis
-0.06
apollo
-0.06
kp
-0.06
_partition
-0.06
-0.06
ObservableCollection
-0.06
_deposit
-0.06
JButton
-0.06
POSITIVE LOGITS
505
0.07
lijke
0.06
acing
0.06
日に
0.06
_sr
0.06
-am
0.06
../../
0.06
.INFO
0.06
rar
0.06
trig
0.06
Activations Density 0.334%