INDEX
Explanations
themes of altruism and compassion
New Auto-Interp
Negative Logits
yas
-0.15
лам
-0.14
(*)(
-0.14
indsight
-0.14
educt
-0.14
sed
-0.14
olum
-0.14
okud
-0.14
sed
-0.13
URATION
-0.13
POSITIVE LOGITS
generosity
0.20
giving
0.19
altru
0.18
caring
0.18
Giving
0.18
generous
0.18
gestures
0.17
service
0.17
recipro
0.17
philanth
0.16
Activations Density 0.223%