INDEX
Explanations
text related to literary works and authors
terms related to literature and literary criticism
New Auto-Interp
Negative Logits
rals
-0.74
xon
-0.70
twitch
-0.64
addafi
-0.63
tnc
-0.63
nels
-0.63
manned
-0.62
Bots
-0.61
arov
-0.61
Mandal
-0.61
POSITIVE LOGITS
laureate
1.05
manuscript
0.91
manuscripts
0.90
Fiction
0.89
letters
0.89
poetry
0.89
reading
0.89
fiction
0.88
novels
0.88
poems
0.85
Activations Density 0.068%