INDEX
    Explanations

    conversation snippets

    situations involving relationship dynamics and emotional connections.

    This neuron detects speaker-identifying tokens or turn labels (e.g., “Z:”, “O:”, names) marking who’s speaking.

    New Auto-Interp
    Negative Logits
    CEPTION
    -0.07
     ise
    -0.06
     Px
    -0.06
    strap
    -0.06
     Ax
    -0.06
     okul
    -0.06
    Ax
    -0.06
    	System
    -0.06
     decoder
    -0.06
     footwear
    -0.06
    POSITIVE LOGITS
    іка
    0.07
    自分の
    0.07
     ấn
    0.06
     isOpen
    0.06
    віт
    0.06
    _CMD
    0.06
     سال
    0.06
    建议
    0.06
    /Create
    0.06
     bruk
    0.06
    Act Density 0.064%

    No Known Activations