INDEX
    Explanations

    The neuron activates on occurrences of the word “complete,” especially when it appears in titles or headings.

    New Auto-Interp
    Negative Logits
    xford
    -0.07
    .Pixel
    -0.07
    igy
    -0.07
     Wor
    -0.07
    .XR
    -0.06
     hy
    -0.06
     Hük
    -0.06
    approval
    -0.06
    /OR
    -0.06
     آز
    -0.06
    POSITIVE LOGITS
     complete
    0.14
     Complete
    0.11
    Complete
    0.08
    complete
    0.08
     Claire
    0.08
     completeness
    0.08
    完整
    0.08
    .Complete
    0.07
    -complete
    0.07
    Comple
    0.07
    Act Density 0.011%

    No Known Activations