INDEX
Explanations
positive attributes related to ethics and morality
references to compassion and greed
New Auto-Interp
Negative Logits
Marble
-0.88
Surface
-0.80
Signal
-0.73
Completed
-0.72
Offline
-0.71
Rou
-0.70
Ja
-0.70
Brotherhood
-0.69
SIG
-0.68
mol
-0.67
POSITIVE LOGITS
compassion
2.65
greed
2.56
compassionate
1.96
greedy
1.59
selfish
1.47
altru
1.42
disdain
1.19
condesc
1.11
generosity
1.09
bene
1.09
Activations Density 0.022%