INDEX
    Explanations

    The neuron activates on category lines specifying a cause of death (e.g. “Deaths from …”).

    New Auto-Interp
    Negative Logits
     positives
    -0.07
     Level
    -0.07
    IVING
    -0.07
     delta
    -0.07
    ッチ
    -0.07
     started
    -0.06
     sparkle
    -0.06
    simd
    -0.06
     Street
    -0.06
    Actions
    -0.06
    POSITIVE LOGITS
     보호
    0.06
     enclave
    0.06
    searchModel
    0.06
     béné
    0.06
    Dst
    0.05
    PressEvent
    0.05
    orestation
    0.05
    hay
    0.05
     площад
    0.05
    Coupon
    0.05
    Act Density 0.003%

    No Known Activations