INDEX
    Explanations

    This neuron activates on words related to personalized or targeted marketing (e.g., personal/personalization, personalized, targeted, targeting, segmentation).

    New Auto-Interp
    Negative Logits
    daemon
    -0.07
    rint
    -0.06
    -pro
    -0.06
     WAN
    -0.06
    lements
    -0.06
    .hl
    -0.06
    -0.06
     near
    -0.06
     wake
    -0.06
     b
    -0.06
    POSITIVE LOGITS
    0.07
     luxurious
    0.07
     швид
    0.06
     aldığı
    0.06
    .stats
    0.06
    0.06
    0.06
     đúng
    0.06
    187
    0.06
    0.06
    Act Density 0.031%

    No Known Activations