INDEX
Explanations
negative aspects
The neuron activates on language pointing out defects, disadvantages, or shortcomings—i.e. negative evaluations of prior work.
New Auto-Interp
Negative Logits
itm
-0.08
ache
-0.07
lection
-0.07
_Login
-0.07
flower
-0.06
Hồ
-0.06
.login
-0.06
Winning
-0.06
methods
-0.06
δώ
-0.06
POSITIVE LOGITS
хими
0.07
pyx
0.07
intermedi
0.06
OutlineInputBorder
0.06
ignore
0.06
582
0.06
glyphicon
0.06
FirebaseDatabase
0.06
IService
0.06
$#
0.06
Activations Density 0.026%