INDEX
    Explanations

    The Shining

    This neuron activates on subword pieces of the film title “Shining,” effectively spotting mentions of “The Shining.”

    New Auto-Interp
    Negative Logits
     Nurs
    -0.06
    -0.06
     alınması
    -0.06
     masks
    -0.06
    -0.06
     pytest
    -0.06
    ewidth
    -0.06
     adulti
    -0.06
    주시
    -0.06
    (passport
    -0.06
    POSITIVE LOGITS
     (!((
    0.07
     discrepancies
    0.06
     energy
    0.06
     sitcom
    0.06
     outnumber
    0.06
    reative
    0.06
     сбор
    0.06
     smoothly
    0.06
     through
    0.06
     обы
    0.06
    Act Density 0.002%

    No Known Activations