INDEX
Explanations
The neuron fires on tokens in the “forgotten to” construction—that is, it detects when the text says someone forgot to do something.
New Auto-Interp
Negative Logits
Minuten
-0.07
orem
-0.06
olu
-0.06
ฟอร
-0.06
[next
-0.06
อดภ
-0.06
FIR
-0.06
лет
-0.06
Tot
-0.06
�
-0.05
POSITIVE LOGITS
погод
0.07
Highlights
0.07
()].
0.07
:.
0.07
)!
0.07
.Out
0.06
forgot
0.06
TS
0.06
MemoryWarning
0.06
!:
0.06
Activations Density 0.017%