INDEX
Explanations
words related to compassion and kindness
references to the concept of compassion
New Auto-Interp
Negative Logits
*/(
-0.76
jong
-0.70
paces
-0.67
Rite
-0.64
Bam
-0.64
kj
-0.61
Amend
-0.60
seq
-0.58
hof
-0.58
Reich
-0.58
POSITIVE LOGITS
ately
1.07
iously
0.86
laureate
0.80
rehend
0.80
acy
0.79
ISM
0.73
onest
0.72
assion
0.72
compassionate
0.71
fulness
0.71
Activations Density 0.010%