INDEX
    Explanations

    terms related to rewards and punishments

    concepts related to rewards, recognition, and outcomes of actions.

    New Auto-Interp
    Negative Logits
    .*")]
    -0.65
     препратки
    -0.62
    WebRequest
    -0.62
    Enlaces
    -0.58
     Larkin
    -0.57
     surla
    -0.57
    ьа
    -0.56
     Fle
    -0.56
     CGRect
    -0.55
    Vat
    -0.55
    POSITIVE LOGITS
     reward
    1.73
     rewards
    1.59
     Reward
    1.52
    reward
    1.44
     Rewards
    1.42
    Reward
    1.39
     rewarded
    1.37
    Rewards
    1.26
     rewarding
    1.23
     punish
    1.19
    Act Density 0.144%

    No Known Activations