INDEX
Explanations
terms related to compassion
references to compassion and its related concepts
New Auto-Interp
Negative Logits
kj
-0.75
jong
-0.72
Rite
-0.70
*/(
-0.69
Bam
-0.68
wagen
-0.67
azeera
-0.65
andals
-0.65
bris
-0.64
ORN
-0.62
POSITIVE LOGITS
ately
0.89
compassionate
0.86
laureate
0.84
itably
0.81
towards
0.79
ãĥĨãĤ£
0.77
toward
0.76
compassion
0.72
Towards
0.72
giving
0.70
Activations Density 0.049%