INDEX
    Explanations

    This neuron responds to the initial question words in a user’s query—e.g. the “How can I” at the start of each question.

    New Auto-Interp
    Negative Logits
     Overrides
    -0.06
     ves
    -0.06
     inserting
    -0.06
    descriptor
    -0.06
    _exception
    -0.06
    834
    -0.06
    ैं।
    -0.06
    hes
    -0.05
    valid
    -0.05
    gett
    -0.05
    POSITIVE LOGITS
    0.06
    ้ว
    0.06
     breed
    0.06
     располож
    0.06
    __":↵
    0.06
     MOD
    0.06
    یدن
    0.06
     مي
    0.06
    یک
    0.06
     impedance
    0.06
    Act Density 0.043%

    No Known Activations