INDEX
    Explanations

    The neuron activates on words denoting a small, approximate quantity—most notably the word “few.”

    New Auto-Interp
    Negative Logits
     perennial
    -0.07
     Chronicle
    -0.07
     weblog
    -0.06
     Lok
    -0.06
    092
    -0.06
    .nombre
    -0.06
    -0.06
    gregator
    -0.06
    QP
    -0.06
    oxy
    -0.06
    POSITIVE LOGITS
     a
    0.09
    -the
    0.08
    -a
    0.07
     very
    0.07
     an
    0.07
     les
    0.07
     the
    0.07
    	an
    0.06
    IL
    0.06
     mor
    0.06
    Act Density 0.032%

    No Known Activations