INDEX
    Explanations

    explicit references to dates and times—calendar months, years, timestamps, and other recency/real-time context markers within responses.

    This neuron detects mentions of the model metadata token (e.g. “model”) and associated numeric or timestamp values.

    New Auto-Interp
    Negative Logits
    هر
    0.31
     яр
    0.27
    ا
    0.25
     ২০১৫
    0.24
     फक्त
    0.24
    ارية
    0.23
    非常
    0.23
     über
    0.22
    经过
    0.22
     сразу
    0.21
    POSITIVE LOGITS
     vaccines
    0.29
     lockdowns
    0.27
     Biden
    0.27
     Vaccination
    0.26
     Cora
    0.26
     COR
    0.26
     OpenAI
    0.26
     coronavirus
    0.25
     ChatGPT
    0.25
    openai
    0.25
    Act Density 2.510%

    No Known Activations