INDEX
Explanations
the definite article "the"
the word "the" in various contexts throughout the document
New Auto-Interp
Negative Logits
thood
-0.76
iffe
-0.64
leeve
-0.62
aba
-0.61
claw
-0.60
vine
-0.60
oller
-0.59
afe
-0.58
craft
-0.56
backer
-0.56
POSITIVE LOGITS
ses
1.17
same
1.17
longest
1.07
entire
1.07
latter
1.05
quickest
1.04
fastest
1.03
hardest
1.02
entirety
0.99
slightest
0.98
Activations Density 0.343%