INDEX
Explanations
The neuron preferentially activates on verbs in their “-ing” (present-participle/gerund) form.
New Auto-Interp
Negative Logits
uco
-0.08
'",
-0.07
ullet
-0.06
_FM
-0.06
_pkg
-0.06
]
-0.06
eah
-0.06
bx
-0.06
ccount
-0.06
.extract
-0.06
POSITIVE LOGITS
Classes
0.07
Quite
0.07
यह
0.06
حجم
0.06
거야
0.06
北京
0.06
Pending
0.06
-images
0.06
Naples
0.06
)(↵
0.06
Activations Density 0.070%