INDEX
Explanations
full names of authors of articles, potentially from different sources
the authorship or attribution of content in a document
New Auto-Interp
Negative Logits
stood
-0.91
soDeliveryDate
-0.77
',"
-0.71
,'"
-0.67
discriminated
-0.65
').
-0.64
evidenced
-0.62
positives
-0.62
discriminate
-0.62
restitution
-0.61
POSITIVE LOGITS
Posted
1.04
Contribut
1.03
(@
1.00
<|endoftext|>
0.99
↵↵
0.98
Published
0.98
·
0.97
»
0.92
↵
0.89
âĸº
0.83
Activations Density 0.207%