INDEX
Explanations
themes of love, charity, and compassion in discussions about moral values
associated with positive emotions
love and forgiveness
New Auto-Interp
Negative Logits
flops
-0.50
culada
-0.46
IntoConstraints
-0.46
restlessness
-0.46
存于互联网档案馆
-0.46
iedział
-0.46
siphon
-0.44
teenth
-0.44
stealth
-0.44
fidget
-0.44
POSITIVE LOGITS
kindness
0.90
Kindness
0.88
compassion
0.79
Compassion
0.79
Compassion
0.79
Forgiveness
0.75
peace
0.74
hate
0.73
humanity
0.73
kinder
0.71
Activations Density 0.207%