INDEX
Explanations
personal experiences and perspectives related to change and adaptation
New Auto-Interp
Negative Logits
themselves
-0.40
us
-0.34
itself
-0.27
yourselves
-0.26
ours
-0.24
nous
-0.21
nám
-0.20
we
-0.19
me
-0.19
μαÏĤ
-0.18
POSITIVE LOGITS
myself
0.85
my
0.43
mijn
0.40
æĪijçļĦ
0.40
minha
0.36
meinem
0.33
jsem
0.33
бÑĥдÑĥ
0.33
meiner
0.30
meine
0.30
Activations Density 1.754%