INDEX
    Explanations

    mentions of dates or date-related phrases (e.g., years, months, "current date", "knowledge cutoff").

    The neuron is detecting numeric tokens and punctuation used in dates (e.g. year, month, day numbers and their separators).

    New Auto-Interp
    Negative Logits
    4
    -0.09
    2
    -0.09
     Hod
    -0.09
     Landing
    -0.09
    0
    -0.09
     Lor
    -0.08
    92
    -0.08
    255
    -0.08
    3
    -0.08
     Morr
    -0.08
    POSITIVE LOGITS
    âĸłâĸł
    0.11
    пÑĢимеÑĢ
    0.09
    toa
    0.09
     Schultz
    0.09
    .UserInfo
    0.09
    ccess
    0.09
    ellers
    0.08
     Cassidy
    0.08
    ynet
    0.08
     Chat
    0.08
    Act Density 0.012%

    No Known Activations