INDEX
    Explanations

    question marks

    This neuron responds to special sequence‐boundary tokens (e.g. end‐of‐turn or end‐of‐text markers).

    New Auto-Interp
    Negative Logits
     عشق
    -0.06
    (sys
    -0.06
    计划
    -0.06
    Bush
    -0.06
     Zac
    -0.06
     Mercy
    -0.06
    -0.06
     Asians
    -0.06
     下午
    -0.06
     همیشه
    -0.06
    POSITIVE LOGITS
    .espresso
    0.07
     vide
    0.06
     optimizations
    0.06
    0.06
     unknow
    0.06
     totaling
    0.06
    웨어
    0.06
    grave
    0.06
     sca
    0.06
    .he
    0.06
    Act Density 0.050%

    No Known Activations