INDEX
Explanations
instances where "the" is used in various contexts within the text
New Auto-Interp
Negative Logits
(ed
-0.18
own
-0.17
decess
-0.14
stood
-0.14
nin
-0.14
seek
-0.13
room
-0.13
éł
-0.13
vers
-0.13
ned
-0.13
POSITIVE LOGITS
same
0.41
ses
0.39
following
0.33
latter
0.30
entire
0.28
likes
0.27
entirety
0.24
same
0.24
various
0.23
majority
0.23
Activations Density 3.576%