INDEX
    Explanations

    The neuron responds to tokens that mark the start of a new broadcast segment or speaker turn—particularly the capitalized words and short phrases used as show intros or host cues.

    New Auto-Interp
    Negative Logits
     thực
    -0.07
    (download
    -0.07
     Ripple
    -0.07
    &q
    -0.07
     pane
    -0.07
    .pr
    -0.06
    Appointment
    -0.06
    	bt
    -0.06
    .getItems
    -0.06
     Venue
    -0.06
    POSITIVE LOGITS
     підвищ
    0.07
     зобов
    0.07
    0.07
    esy
    0.06
     спіл
    0.06
    .MULT
    0.06
    .proxy
    0.06
    existence
    0.06
    】↵↵
    0.06
    (FLAGS
    0.06
    Act Density 0.014%

    No Known Activations