INDEX
    Explanations

    phrases related to rewards and rewards systems

    New Auto-Interp
    Negative Logits
    abad
    -0.88
    enium
    -0.85
    abases
    -0.81
     Ange
    -0.79
    lander
    -0.77
    obiles
    -0.77
    opic
    -0.75
     Osw
    -0.75
    head
    -0.74
     Alic
    -0.74
    POSITIVE LOGITS
     reward
    1.23
     rewarded
    1.20
     rewards
    1.16
     rewarding
    0.97
     reap
    0.90
    Reward
    0.90
     handsome
    0.87
     Reward
    0.86
     incentive
    0.86
     reinforcement
    0.85
    Act Density 14.218%

    No Known Activations