INDEX
Explanations
mentions of books with different contexts, such as library-related discussions, reading recommendations, and book purchases
New Auto-Interp
Negative Logits
00200000
-0.72
Yin
-0.70
xon
-0.69
ntil
-0.68
Vital
-0.66
abwe
-0.65
Bots
-0.64
yip
-0.64
ilitary
-0.64
alty
-0.62
POSITIVE LOGITS
stores
1.37
hel
1.24
shop
1.15
seller
1.07
books
1.06
worms
1.05
marks
1.00
worm
1.00
sell
0.99
reading
0.94
Activations Density 0.047%