INDEX
Explanations
gradient
This neuron fires on occurrences of “gradient” (especially in “gradient descent”), i.e. it recognizes gradient-related terminology.
New Auto-Interp
Negative Logits
mobs
-0.08
voucher
-0.07
ubuntu
-0.07
umor
-0.06
-abs
-0.06
告
-0.06
165
-0.06
richText
-0.06
ulators
-0.06
766
-0.06
POSITIVE LOGITS
innovative
0.07
(Sprite
0.07
.HORIZONTAL
0.07
�
0.07
estamos
0.06
dejtings
0.06
habil
0.06
Coordinates
0.06
rou
0.06
ослож
0.06
Activations Density 0.002%