INDEX
Explanations
proper nouns and significant identifiers in the text
New Auto-Interp
Negative Logits
AKE
-0.16
erman
-0.15
emen
-0.15
efd
-0.14
äºĭ
-0.14
ermann
-0.14
vero
-0.14
INDOW
-0.14
ermen
-0.13
NJ
-0.13
POSITIVE LOGITS
iple
0.16
Dann
0.15
-fit
0.15
imoto
0.15
arest
0.15
ncoder
0.14
ntity
0.14
izu
0.13
cury
0.13
Birch
0.13
Activations Density 0.003%