INDEX
Explanations
names and proper nouns related to people and places
New Auto-Interp
Negative Logits
ka
-0.30
li
-0.28
nya
-0.27
so
-0.24
la
-0.24
me
-0.23
ning
-0.23
ma
-0.23
ne
-0.23
che
-0.22
POSITIVE LOGITS
Ùĭ
0.26
ught
0.24
’nın
0.22
issance
0.22
ughter
0.21
'nın
0.21
election
0.20
irement
0.20
eus
0.20
ughty
0.19
Activations Density 1.018%