INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     Пол
    -0.07
     retiring
    -0.07
    bben
    -0.07
     aired
    -0.07
    artment
    -0.07
    (Pos
    -0.07
    SEN
    -0.07
     died
    -0.06
     office
    -0.06
     اب
    -0.06
    POSITIVE LOGITS
     stimulus
    0.08
    IMAGE
    0.08
     stimuli
    0.07
     Instantiate
    0.06
    ups
    0.06
    stm
    0.06
    ulse
    0.06
    Markup
    0.06
    Hit
    0.06
     Tomato
    0.06
    Act Density 0.004%

    No Known Activations