INDEX
Explanations
sentences related to accepting oneself and others despite external judgment
New Auto-Interp
Negative Logits
raltar
-0.76
è¦ļéĨĴ
-0.74
xtap
-0.61
?????-?????-
-0.60
encount
-0.60
è£ħ
-0.59
ccording
-0.59
rontal
-0.59
icipated
-0.58
earthqu
-0.57
POSITIVE LOGITS
theirs
1.46
him
1.11
hers
1.08
us
0.89
themselves
0.89
me
0.88
yours
0.87
their
0.87
them
0.86
their
0.85
Activations Density 2.966%