INDEX
Explanations
themes related to forgiveness and reconciliation
New Auto-Interp
Negative Logits
uess
-0.15
Opaque
-0.15
aset
-0.15
aret
-0.14
ARK
-0.14
ipp
-0.14
korun
-0.14
å®¶çļĦ
-0.14
_TP
-0.13
ãĥ³ãĥij
-0.13
POSITIVE LOGITS
Forg
0.27
forgiveness
0.24
reconciliation
0.22
forgiving
0.21
forgiven
0.20
reconc
0.19
abs
0.17
resolution
0.17
forgive
0.17
ptal
0.16
Activations Density 0.210%