INDEX
Explanations
mentions of the word "book."
mentions of books
New Auto-Interp
Negative Logits
ilitary
-0.75
Bots
-0.72
sembly
-0.68
xon
-0.67
distant
-0.65
twitch
-0.64
cffff
-0.64
Yin
-0.64
Lumpur
-0.64
VIDEOS
-0.62
POSITIVE LOGITS
stores
1.49
seller
1.27
marks
1.27
shop
1.14
marked
1.11
cases
1.09
worms
1.08
worm
1.05
book
1.03
keeping
1.02
Activations Density 0.038%