INDEX
Explanations
headers and section titles in a document
HTML heading tags (h2, h3, h4)
words with long s
New Auto-Interp
Negative Logits
ans
-0.71
co
-0.70
de
-0.70
to
-0.69
di
-0.68
ir
-0.68
b
-0.68
col
-0.67
in
-0.66
or
-0.66
POSITIVE LOGITS
itſelf
1.32
juſt
1.20
greateſt
1.18
pleaſure
1.17
ſever
1.16
myſelf
1.15
ſmall
1.13
Diſ
1.12
leſs
1.11
deſt
1.11
Activations Density 0.065%