INDEX
    Explanations

    The neuron specifically detects the phrase “cater to.”

    New Auto-Interp
    Negative Logits
    емых
    -0.07
    (Method
    -0.07
     valide
    -0.07
     STATIC
    -0.07
     puts
    -0.06
     обязатель
    -0.06
     Hou
    -0.06
     INS
    -0.06
     نیروی
    -0.06
     bumps
    -0.06
    POSITIVE LOGITS
     cater
    0.14
     catering
    0.11
     Cater
    0.11
    )reader
    0.07
     Attribution
    0.07
     serving
    0.06
    water
    0.06
     met
    0.06
     attribution
    0.06
    atever
    0.06
    Act Density 0.002%

    No Known Activations