INDEX
    Explanations

    most common

    This neuron detects superlative-frequency phrases indicating the most common or most frequent items (e.g. “most common,” “most frequent,” “commonest,” “most prevalent”).

    New Auto-Interp
    Negative Logits
     sıras
    -0.07
    Negative
    -0.07
     humili
    -0.06
    анг
    -0.06
     bruises
    -0.06
    -0.06
     вним
    -0.06
    OG
    -0.06
    umb
    -0.06
    كل
    -0.06
    POSITIVE LOGITS
    0.08
    [I
    0.07
    /library
    0.07
     []:↵
    0.07
    -cn
    0.06
    0.06
     ein
    0.06
     Afro
    0.06
    (I
    0.06
    0.06
    Act Density 0.024%

    No Known Activations