INDEX
Explanations
words related to literary or academic works
references to scholarly or influential works
New Auto-Interp
Negative Logits
PID
-0.80
Bots
-0.74
Cookie
-0.68
effect
-0.66
Deploy
-0.66
Predator
-0.65
twitch
-0.64
Generic
-0.64
Breed
-0.63
Mane
-0.63
POSITIVE LOGITS
ographies
1.02
ibliography
0.99
manuscripts
0.98
uscript
0.97
essays
0.94
ographer
0.91
published
0.91
books
0.89
ographers
0.89
archive
0.87
Activations Density 0.146%