INDEX
Explanations
references to books and reading-related activities
New Auto-Interp
Negative Logits
ilitary
-0.70
Bots
-0.70
xon
-0.65
asonic
-0.64
00200000
-0.63
uppet
-0.61
hod
-0.60
Yin
-0.60
twitch
-0.60
nel
-0.58
POSITIVE LOGITS
stores
1.35
seller
1.21
shop
1.13
manuscript
1.12
marks
1.01
books
1.00
publisher
0.99
worm
0.98
publishers
0.98
books
0.97
Activations Density 2.003%