INDEX
Explanations
The neuron never activates (it’s effectively “dead” and does not detect any pattern).
New Auto-Interp
Negative Logits
tolower
-0.06
TOOLS
-0.06
Unt
-0.06
gun
-0.06
Ste
-0.06
favoured
-0.06
Servers
-0.06
_pid
-0.06
_ste
-0.06
irony
-0.06
POSITIVE LOGITS
_similarity
0.07
防
0.07
terra
0.07
481
0.07
ото
0.06
Tina
0.06
<?↵
0.06
──
0.06
ilip
0.06
Там
0.06
Activations Density 0.001%