INDEX
    Explanations

    programming/computer science

    The neuron spikes on mentions of content‐policy override instructions—words like “normal,” “content,” “policies,” etc., indicating a discussion of replacing or ignoring content policies.

    New Auto-Interp
    Negative Logits
    >>>>>>>
    -0.06
     Yo
    -0.06
     této
    -0.06
    ,*
    -0.06
     Dix
    -0.06
    Doc
    -0.06
     quiero
    -0.06
    ,%
    -0.06
    _door
    -0.05
     Krank
    -0.05
    POSITIVE LOGITS
    ificate
    0.07
    ElementsByTagName
    0.07
     frankly
    0.07
     existential
    0.07
    onical
    0.06
     JTextField
    0.06
     thermometer
    0.06
    '))↵↵↵
    0.06
     otur
    0.06
    Interested
    0.06
    Act Density 0.000%

    No Known Activations