INDEX
    Explanations

    This neuron detects words referring to conflicts or collisions (e.g., naming, routing, or resource conflicts).

    New Auto-Interp
    Negative Logits
    -0.07
     trained
    -0.07
    职业
    -0.07
    _BREAK
    -0.06
     Graduate
    -0.06
    acent
    -0.06
    Dallas
    -0.06
    panies
    -0.06
    测试
    -0.06
     Cunningham
    -0.06
    POSITIVE LOGITS
     screams
    0.07
    ुँ
    0.06
    sj
    0.06
    rho
    0.06
    ersiz
    0.06
    >(*
    0.06
     Terraria
    0.06
    mayı
    0.06
    NEXT
    0.06
     Tomas
    0.06
    Act Density 0.009%

    No Known Activations