INDEX
    Explanations

    This neuron fires on occurrences of the word “answer” (e.g. “Answer,” “answering,” “Questions”)—i.e. it detects Q&A or answer segments.

    New Auto-Interp
    Negative Logits
     strr
    -0.07
    duk
    -0.07
     ICC
    -0.06
    mmc
    -0.06
    Gap
    -0.06
     envision
    -0.06
    adoop
    -0.06
    .setScene
    -0.06
     ساخته
    -0.06
    arı
    -0.06
    POSITIVE LOGITS
     answering
    0.08
     answered
    0.08
     reassure
    0.07
     responding
    0.06
    ,but
    0.06
    ितन
    0.06
     наг
    0.06
    ,),
    0.06
     Meaning
    0.06
     sem
    0.06
    Act Density 0.043%

    No Known Activations