INDEX
Explanations
dates and years
mentions of dates or date-related phrases (e.g., years, months, "current date", "knowledge cutoff").
The neuron is detecting numeric tokens and punctuation used in dates (e.g. year, month, day numbers and their separators).
New Auto-Interp
Negative Logits
4
-0.09
2
-0.09
Hod
-0.09
Landing
-0.09
0
-0.09
Lor
-0.08
92
-0.08
255
-0.08
3
-0.08
Morr
-0.08
POSITIVE LOGITS
âĸłâĸł
0.11
пÑĢимеÑĢ
0.09
toa
0.09
Schultz
0.09
.UserInfo
0.09
ccess
0.09
ellers
0.08
Cassidy
0.08
ynet
0.08
Chat
0.08
Activations Density 0.012%