INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    -0.07
    ğ
    -0.07
    -0.07
    -0.07
    ’in
    -0.06
    ivatel
    -0.06
    spinner
    -0.06
    }}">↵
    -0.06
    цвет
    -0.06
     interfering
    -0.06
    POSITIVE LOGITS
     LABEL
    0.08
     stimuli
    0.07
    лон
    0.07
     physiological
    0.07
     Players
    0.07
     toplumsal
    0.07
     Julie
    0.07
     biologist
    0.06
     motivating
    0.06
     asking
    0.06
    Act Density 0.010%

    No Known Activations