INDEX
    Explanations

    The neuron selectively activates on the verb “manipulate” (and its inflected forms like “manipulating”), flagging occurrences of that word.

    New Auto-Interp
    Negative Logits
     професій
    -0.06
     cortical
    -0.06
    edium
    -0.06
    ortality
    -0.06
    lications
    -0.06
     focal
    -0.06
     Scott
    -0.06
     horrific
    -0.06
     ecosystems
    -0.06
    licted
    -0.06
    POSITIVE LOGITS
     manipulation
    0.12
     manip
    0.12
     Manip
    0.11
     manipulated
    0.10
    Manip
    0.09
     manipulating
    0.09
     manipulate
    0.08
     [...]
    0.08
     Manning
    0.07
    operate
    0.07
    Act Density 0.010%

    No Known Activations