INDEX
Explanations
Punctuation and conjunctions
The main thing this neuron does is spot mentions of fishing poles (i.e. “fishing”) in the text.
New Auto-Interp
Negative Logits
000
-0.07
edicine
-0.07
045
-0.06
Severity
-0.06
(chart
-0.06
呼
-0.06
stems
-0.06
CEOs
-0.06
(Adapter
-0.06
.bo
-0.06
POSITIVE LOGITS
аліст
0.07
�
0.06
OLER
0.06
apat
0.06
сыл
0.06
relevant
0.06
loud
0.06
ेत
0.06
voř
0.06
разреш
0.06
Activations Density 0.323%