INDEX
Explanations
references to authors, books, and literary works
New Auto-Interp
Negative Logits
olia
-0.17
ì§ĵ
-0.15
ORITY
-0.15
_Style
-0.15
å¾ĭ
-0.14
ovice
-0.14
ìĬ¹
-0.14
Property
-0.14
Property
-0.14
kan
-0.14
POSITIVE LOGITS
bookstore
0.32
Books
0.28
Book
0.28
book
0.27
books
0.27
independent
0.25
bibli
0.25
independents
0.25
Books
0.25
Independent
0.23
Activations Density 0.065%