INDEX
Explanations
themes of powerlessness and choice in literature
New Auto-Interp
Negative Logits
books
-0.17
Books
-0.17
fran
-0.16
rama
-0.16
book
-0.16
-books
-0.16
rios
-0.15
apid
-0.14
helf
-0.14
zew
-0.14
POSITIVE LOGITS
short
0.26
çŁŃ
0.22
essay
0.22
-short
0.22
short
0.21
shorts
0.21
essays
0.21
(short
0.20
Short
0.20
oug
0.20
Activations Density 0.152%