INDEX
Explanations
mentions of media events or cultural references
New Auto-Interp
Negative Logits
olia
-0.17
olie
-0.15
kan
-0.15
à¸²à¸ł
-0.14
å¡
-0.14
еж
-0.14
ucas
-0.13
ç¼
-0.13
å¾ĭ
-0.13
ardi
-0.13
POSITIVE LOGITS
bookstore
0.41
Books
0.38
Book
0.38
book
0.37
books
0.35
Books
0.34
bibli
0.32
Book
0.32
BOOK
0.31
книж
0.31
Activations Density 0.115%