INDEX
Explanations
terms related to rewards or benefits
references to economic incentives
New Auto-Interp
Negative Logits
rooms
-0.92
room
-0.78
gaard
-0.76
lings
-0.75
erial
-0.75
lain
-0.75
algia
-0.74
mbuds
-0.73
bane
-0.73
uck
-0.72
POSITIVE LOGITS
incentives
1.07
incentive
1.07
incentiv
1.04
incent
0.97
rewarded
0.87
Reviewer
0.86
schemes
0.86
scheme
0.83
motivate
0.83
compulsion
0.81
Activations Density 0.025%