INDEX
    Explanations

    sports wins

    The neuron activates on numeric tokens (especially years and other multi-digit numbers).

    New Auto-Interp
    Negative Logits
    .windows
    -0.07
     temin
    -0.06
     anthology
    -0.06
    GEN
    -0.06
    .support
    -0.06
    ')['
    -0.06
    包含
    -0.06
    AFF
    -0.06
    .Ex
    -0.06
     potassium
    -0.06
    POSITIVE LOGITS
    [args
    0.07
    мін
    0.07
     sentiments
    0.07
     действия
    0.06
    !
    0.06
    stack
    0.06
     liberties
    0.06
    0.06
    mouseover
    0.06
     Jihad
    0.06
    Act Density 0.024%

    No Known Activations