INDEX
    Explanations

    The neuron activates on terms related to employee pay and benefits (e.g., “compensation,” “packages,” “benefits”).

    New Auto-Interp
    Negative Logits
     تج
    -0.07
    ETwitter
    -0.06
    osto
    -0.06
     Newtown
    -0.06
     towel
    -0.06
    958
    -0.06
    zzle
    -0.06
     ViewState
    -0.06
    ASON
    -0.06
     문서
    -0.06
    POSITIVE LOGITS
     features
    0.08
     Benefits
    0.07
    valor
    0.07
     feature
    0.07
     benefits
    0.07
    _dbg
    0.06
     WX
    0.06
     televizyon
    0.06
     Mobil
    0.06
     Percent
    0.06
    Act Density 0.005%

    No Known Activations