INDEX
    Explanations

    information related to rewards and financial incentives

    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.88
    TagMode
    -0.86
     الحره
    -0.81
     שוליים
    -0.75
    featureID
    -0.74
    ChildScrollView
    -0.74
     للمعارف
    -0.70
    Controllo
    -0.69
    testens
    -0.69
    hoeddwyd
    -0.68
    POSITIVE LOGITS
     reward
    1.08
     rewards
    0.97
     incentives
    0.90
    reward
    0.89
    Reward
    0.88
     Rewards
    0.87
     Reward
    0.87
     incentiv
    0.85
     incentive
    0.81
     prize
    0.81
    Act Density 0.219%

    No Known Activations