INDEX
Explanations
sentences that start with "This"
markers or indicators of article structure and separation, particularly the placeholder for content
New Auto-Interp
Negative Logits
onto
-0.83
units
-0.73
aires
-0.69
KNOWN
-0.68
ickets
-0.68
pots
-0.67
iations
-0.67
ials
-0.66
nets
-0.66
arson
-0.65
POSITIVE LOGITS
week
1.07
article
1.04
slideshow
1.03
morning
0.90
month
0.88
weekend
0.88
Week
0.87
concludes
0.87
infographic
0.86
column
0.84
Activations Density 0.134%