INDEX
    Explanations

    Conditional statements

    phrases starting with "In" at the beginning of a document.

    This neuron activates on the leading phrase “In the” at the start of a sentence.

    New Auto-Interp
    Negative Logits
     justification
    -0.06
    incinn
    -0.06
     ignorant
    -0.06
    Speed
    -0.06
    brook
    -0.05
     POW
    -0.05
     legalization
    -0.05
    フォ
    -0.05
     literature
    -0.05
    WG
    -0.05
    POSITIVE LOGITS
     baktı
    0.07
     incluso
    0.07
    анії
    0.07
    bra
    0.07
     olmadan
    0.07
     cigaret
    0.06
     viability
    0.06
    eson
    0.06
    üncü
    0.06
    .des
    0.06
    Act Density 0.034%

    No Known Activations