INDEX
Explanations
Similarly
The neuron activates on sentence-initial comparative/discourse adverbs (e.g. “Similarly”) that introduce a parallel or analogous point.
New Auto-Interp
Negative Logits
_area
-0.07
\Template
-0.06
πρά
-0.06
Oktober
-0.06
FB
-0.06
oz
-0.06
пр
-0.06
�
-0.06
_msg
-0.06
ortadan
-0.06
POSITIVE LOGITS
Similarly
0.09
Similarly
0.07
similarly
0.07
thrift
0.07
Willie
0.06
Epoch
0.06
ené
0.06
ROP
0.06
EEP
0.06
awe
0.06
Activations Density 0.009%