INDEX
Explanations
phrases pertaining to book reviews and ratings
New Auto-Interp
Negative Logits
Hed
-0.16
hek
-0.16
èµĦ
-0.16
pline
-0.16
бÑĥдÑĮ
-0.15
rung
-0.15
ming
-0.14
ãĥ«ãĥķ
-0.14
inas
-0.14
ILED
-0.14
POSITIVE LOGITS
Quotes
0.19
antine
0.17
Trivia
0.17
enha
0.15
reads
0.15
.library
0.15
audible
0.14
åĢŁ
0.14
rating
0.14
shelves
0.14
Activations Density 0.009%