INDEX
Explanations
mentions of specific books
references to specific books and their titles
New Auto-Interp
Negative Logits
zzle
-0.81
outher
-0.77
nel
-0.77
airspace
-0.73
rained
-0.70
effic
-0.70
pes
-0.68
terior
-0.68
vous
-0.68
retaliate
-0.68
POSITIVE LOGITS
essays
0.97
autobiography
0.93
paperback
0.91
memoir
0.90
autobi
0.89
books
0.85
edited
0.84
catalogue
0.83
books
0.81
publishers
0.81
Activations Density 0.309%