INDEX
Explanations
The neuron selectively activates on the phrase “of course.”
New Auto-Interp
Negative Logits
ician
-0.06
musicians
-0.06
393
-0.06
_below
-0.06
EXEC
-0.06
↵
-0.06
گار
-0.06
.small
-0.06
Pam
-0.06
ді
-0.06
POSITIVE LOGITS
intimately
0.06
Cancellation
0.06
intros
0.06
ισμός
0.06
trieve
0.06
WebRequest
0.06
_ns
0.06
işleri
0.06
ятся
0.06
_reserve
0.06
Activations Density 0.041%