INDEX
    Explanations

    specificity

    The neuron flags occurrences of the term “specificity.”

    New Auto-Interp
    Negative Logits
     plastics
    -0.06
     Randolph
    -0.06
    vekili
    -0.06
    ucket
    -0.06
     cache
    -0.06
     trusted
    -0.06
     comb
    -0.06
    ारक
    -0.06
    кт
    -0.06
    ctime
    -0.06
    POSITIVE LOGITS
    checkout
    0.07
    .spec
    0.06
    0.06
     via
    0.06
     premiere
    0.06
    режд
    0.06
     Rowling
    0.06
    (笑
    0.06
     Portable
    0.06
    0.06
    Act Density 0.015%

    No Known Activations