INDEX
Explanations
phrases related to self-reflection and personal growth
phrases related to personal identity and self-reflection
New Auto-Interp
Negative Logits
Ĥª
-0.64
adelphia
-0.60
ĸļ
-0.59
}}}
-0.57
="#
-0.55
ItemTracker
-0.55
_-_
-0.55
ãħĭãħĭ
-0.55
elvet
-0.54
ãĤ¦ãĤ¹
-0.54
POSITIVE LOGITS
my
1.91
I
1.89
myself
1.65
me
1.41
I
1.38
mine
1.34
MY
1.33
My
1.31
my
1.29
My
1.25
Activations Density 1.643%