INDEX
Explanations
The main thing this neuron does is find specific dates in the format of month names
mentions of the word "ember" and its variants, indicating a focus on the month of December
New Auto-Interp
Negative Logits
SPONSORED
-0.75
Strauss
-0.74
Collider
-0.69
Kissinger
-0.65
egal
-0.64
Wonderful
-0.63
overs
-0.62
Medicare
-0.59
omen
-0.59
Pathfinder
-0.58
POSITIVE LOGITS
mented
0.98
mentation
0.93
ember
0.93
getic
0.92
gence
0.92
gdala
0.91
gments
0.90
nesday
0.89
ment
0.86
utive
0.84
Activations Density 0.009%