INDEX
Explanations
expressions of passion and enjoyment in work
New Auto-Interp
Negative Logits
ocy
-0.19
AGMA
-0.15
éļĨ
-0.14
ucht
-0.13
.Business
-0.13
iller
-0.13
åĮº
-0.13
ido
-0.13
mime
-0.13
orsch
-0.13
POSITIVE LOGITS
satisfaction
0.27
Reward
0.27
rewarding
0.25
Reward
0.25
rewards
0.24
Satisfaction
0.24
reward
0.23
reward
0.23
challenging
0.23
challenge
0.23
Activations Density 0.212%