INDEX
    Explanations

    Thoughts and feelings

    The neuron fires on key content words that denote someone’s focus or objective—nouns like “mind,” “goal,” “thing,” or “strength” that signal what’s central in the discourse.

    New Auto-Interp
    Negative Logits
    /login
    -0.07
    itou
    -0.07
    etus
    -0.07
     PKK
    -0.07
     arrests
    -0.07
    请输入
    -0.07
     lyon
    -0.07
    greens
    -0.06
     Sergei
    -0.06
     Quint
    -0.06
    POSITIVE LOGITS
    __(↵
    0.07
     rep
    0.06
    _READ
    0.06
     sağlar
    0.06
    establish
    0.06
     wherein
    0.06
    -window
    0.06
     Objective
    0.06
     lọc
    0.06
    ์โ
    0.06
    Act Density 0.023%

    No Known Activations