INDEX
Explanations
themes of exploitation and self-interest, particularly in financial and power contexts
Exploitation for personal gain
own ambition profit greed
New Auto-Interp
Negative Logits
LabelTagHelper
-0.53
sério
-0.47
-0.45
شهاد
-0.42
ModelState
-0.41
Innoc
-0.41
зион
-0.41
onError
-0.41
sentin
-0.40
wholes
-0.39
POSITIVE LOGITS
selfish
1.46
selfish
1.20
selfishness
1.16
greed
1.15
ego
1.08
greedy
1.07
profit
1.06
ego
0.97
pecuniary
0.95
Ego
0.92
Activations Density 0.387%