INDEX
Explanations
terms related to financial motivations and incentives
New Auto-Interp
Negative Logits
rooms
-1.00
bane
-0.83
room
-0.83
lain
-0.80
erial
-0.80
mbuds
-0.79
algia
-0.79
miah
-0.75
thing
-0.74
ugu
-0.73
POSITIVE LOGITS
incentive
1.08
incentives
1.06
incentiv
1.00
Reviewer
0.98
schemes
0.91
incent
0.90
scheme
0.88
rewarded
0.84
compulsion
0.84
recipients
0.82
Activations Density 0.008%