INDEX
Explanations
instances where the word "In" is followed by some text
the end-of-text markers in the document
New Auto-Interp
Negative Logits
.","
-0.69
','
-0.66
Ù
-0.63
swe
-0.61
..."
-0.61
.</
-0.61
thereto
-0.61
='
-0.60
Ø
-0.60
""
-0.59
POSITIVE LOGITS
Conclusion
1.24
Lastly
0.99
notations
0.97
withstanding
0.93
endix
0.93
resa
0.90
odore
0.90
xon
0.89
ibliography
0.89
theless
0.87
Activations Density 0.500%