INDEX
Explanations
This neuron never activates (all its activation values are zero), so it doesn’t detect or respond to any pattern.
New Auto-Interp
Negative Logits
københavn
-0.07
_"+
-0.06
"%(
-0.06
attack
-0.06
Bosnia
-0.06
Mes
-0.06
themselves
-0.06
deniz
-0.06
Adidas
-0.06
рави
-0.06
POSITIVE LOGITS
advertisement
0.07
:checked
0.06
tim
0.06
takže
0.06
.var
0.06
superstar
0.06
درمان
0.06
different
0.06
(messages
0.06
зда
0.06
Activations Density 0.009%