INDEX
Explanations
phrases that convey a sense of identity and self-awareness
New Auto-Interp
Negative Logits
ilians
-0.18
ICO
-0.17
:name
-0.15
ãģ«è¦ĭ
-0.15
OMET
-0.14
chaft
-0.14
elenium
-0.14
umont
-0.14
awe
-0.14
indr
-0.14
POSITIVE LOGITS
someone
0.17
Someone
0.16
undergone
0.15
somebody
0.15
someone
0.15
oad
0.14
cul
0.14
ict
0.14
Uhr
0.14
Someone
0.14
Activations Density 0.145%