INDEX
    Explanations

    The neuron activates on conditional “if … then” constructs—i.e. it detects “if” statements paired with “then.”

    New Auto-Interp
    Negative Logits
    -0.08
    -0.07
    azard
    -0.07
    IJ
    -0.07
     mammals
    -0.06
     Labour
    -0.06
     своб
    -0.06
    -0.06
    ุ์
    -0.06
    (mappedBy
    -0.06
    POSITIVE LOGITS
    Titan
    0.08
    Then
    0.07
    으면
    0.07
     Leads
    0.07
     Thom
    0.07
     sonic
    0.07
    ben
    0.07
    .then
    0.07
    čen
    0.07
     Tooth
    0.06
    Act Density 0.012%

    No Known Activations