INDEX
Explanations
words related to selfishness, particularly in personal relationships and decision-making contexts
New Auto-Interp
Negative Logits
Downloadha
-0.80
ICAN
-0.75
ournals
-0.71
enegger
-0.69
oval
-0.69
stable
-0.68
ahead
-0.68
accompanied
-0.67
interrupted
-0.67
aster
-0.67
POSITIVE LOGITS
greed
1.07
motives
0.91
narciss
0.90
jealousy
0.89
greedy
0.87
jealous
0.87
arrogance
0.87
hypocrisy
0.87
hypoc
0.87
narcissistic
0.86
Activations Density 0.079%