INDEX
Explanations
The neuron responds to the term “output,” activating whenever that word (in any context or form) appears.
New Auto-Interp
Negative Logits
жест
-0.07
Regional
-0.06
fingerprint
-0.06
use
-0.06
-job
-0.06
posium
-0.06
수도
-0.06
Icon
-0.06
donors
-0.06
_DU
-0.06
POSITIVE LOGITS
Εκ
0.08
득
0.07
khoảng
0.06
vendor
0.06
profes
0.06
startY
0.06
الإ
0.06
documentation
0.06
बय
0.06
)reader
0.06
Activations Density 0.027%