INDEX
Explanations
concepts related to spiritual practice and individual identity
New Auto-Interp
Negative Logits
iens
-0.15
ibri
-0.14
ozo
-0.14
aney
-0.14
uibModal
-0.14
amaz
-0.14
opc
-0.14
moż
-0.13
.ng
-0.13
udder
-0.13
POSITIVE LOGITS
oneself
0.73
ones
0.53
someone
0.43
ä¸Ģ个人
0.38
Ones
0.38
your
0.37
one
0.36
ones
0.35
someone
0.34
somebody
0.32
Activations Density 0.956%