INDEX
Explanations
proper nouns, particularly names
New Auto-Interp
Negative Logits
enko
-0.18
iÄĻ
-0.17
anca
-0.17
gart
-0.17
íĴį
-0.16
uju
-0.15
an
-0.15
u
-0.15
iyah
-0.15
est
-0.15
POSITIVE LOGITS
nowled
0.28
ismet
0.24
adem
0.23
ademic
0.23
erman
0.22
nowledge
0.22
robat
0.20
zept
0.20
cent
0.20
47
0.19
Activations Density 0.011%