INDEX
Explanations
specific titles of books
references to books and authors
New Auto-Interp
Negative Logits
encount
-0.65
projectile
-0.63
rouse
-0.62
whisk
-0.60
shenan
-0.59
PET
-0.59
skelet
-0.58
enforce
-0.58
combo
-0.57
quir
-0.57
POSITIVE LOGITS
Versus
1.18
Matters
1.15
Without
1.15
Lessons
1.15
Secrets
1.11
Manifest
1.09
Revis
1.09
Handbook
1.09
Lies
1.08
Lives
1.07
Activations Density 0.284%