INDEX
Explanations
tokens that indicate the start of new sections or paragraphs
the presence of boilerplate text or document structure markers
archaic or Spanish words
New Auto-Interp
Negative Logits
":
-0.63
(
-0.62
I
-0.62
a
-0.61
AssemblyTitle
-0.61
-',
-0.60
':
-0.59
brainly
-0.55
-0.55
*',
-0.54
POSITIVE LOGITS
Jefus
1.14
whoſe
1.13
Reſ
1.09
itſelf
1.09
Houſe
1.09
Theſe
1.04
Diſ
1.02
myſelf
1.02
ſelf
1.01
houſe
1.01
Activations Density 0.117%