INDEX
    Explanations

    This neuron responds to verbs and phrases describing military defeat or retreat in battle.

    New Auto-Interp
    Negative Logits
    io
    -0.07
    (ind
    -0.07
    [U
    -0.06
    -0.06
    ΙΟ
    -0.06
    -0.06
    ission
    -0.06
    _LCD
    -0.05
    (Un
    -0.05
    SetFont
    -0.05
    POSITIVE LOGITS
     pym
    0.07
    Cream
    0.07
     vl
    0.07
     حداقل
    0.07
     prestige
    0.06
    cken
    0.06
    _PARTITION
    0.06
     přesně
    0.06
    .diag
    0.06
     DAN
    0.06
    Act Density 0.024%

    No Known Activations