INDEX
Explanations
words and phrases related to personal growth and support
New Auto-Interp
Negative Logits
ourselves
-0.33
(us
-0.23
ours
-0.22
Us
-0.19
us
-0.19
ours
-0.19
yourselves
-0.18
à¹Ģà¸Ńà¸ĩ
-0.18
Us
-0.18
usk
-0.17
POSITIVE LOGITS
me
0.69
me
0.49
-me
0.43
æĪij
0.40
ME
0.40
_me
0.36
менÑı
0.36
met
0.35
.me
0.35
mee
0.35
Activations Density 0.183%