INDEX
Explanations
words related to compassion, empathy, and leniency
concepts related to mercy and compassion
New Auto-Interp
Negative Logits
yi
-0.78
orn
-0.77
ORN
-0.76
andals
-0.74
kj
-0.72
add
-0.66
need
-0.66
ouf
-0.66
ossier
-0.65
gars
-0.64
POSITIVE LOGITS
mercy
1.14
pard
0.90
auctions
0.80
forgiveness
0.76
saf
0.73
efully
0.72
forgive
0.70
pardon
0.69
relief
0.68
giveaway
0.68
Activations Density 0.011%