INDEX
Explanations
proper nouns, specifically names of authors, characters, or notable figures in literature
New Auto-Interp
Negative Logits
anki
-0.17
.GetType
-0.15
eya
-0.15
fikir
-0.14
est
-0.14
abus
-0.14
anic
-0.14
ona
-0.14
atego
-0.14
جÙħ
-0.14
POSITIVE LOGITS
ocker
0.17
shiv
0.15
.hwp
0.15
ucz
0.14
readcr
0.14
ysz
0.14
.scalablytyped
0.14
PasswordEncoder
0.14
.mixin
0.14
ubar
0.13
Activations Density 0.038%