INDEX
Explanations
words related to books, authors, and literary activities
New Auto-Interp
Negative Logits
Bots
-0.70
ilitary
-0.67
00200000
-0.66
Yin
-0.66
twitch
-0.65
hod
-0.63
xon
-0.63
asonic
-0.62
Santiago
-0.59
ptions
-0.58
POSITIVE LOGITS
stores
1.43
seller
1.27
marks
1.17
shop
1.17
worm
1.12
worms
1.06
marked
1.05
sell
1.05
cases
0.99
manuscript
0.98
Activations Density 0.633%