INDEX
Explanations
phrases that promote personal and communal growth through acceptance and awareness
New Auto-Interp
Negative Logits
hed
-0.16
aint
-0.14
<<=
-0.13
thù
-0.13
ARGE
-0.13
biz
-0.13
[color
-0.13
umu
-0.13
nackte
-0.13
cpy
-0.13
POSITIVE LOGITS
how
0.20
what
0.20
Others
0.16
and
0.16
yourself
0.16
reality
0.16
where
0.15
oneself
0.15
ourselves
0.15
Hunger
0.15
Activations Density 0.285%