INDEX
Explanations
instances of authorship or attribution in text
New Auto-Interp
Negative Logits
ues
-0.18
Enumerable
-0.16
alone
-0.15
å¾ĭ
-0.14
quential
-0.14
orer
-0.14
zent
-0.14
夫人
-0.14
Mrs
-0.14
Sit
-0.14
POSITIVE LOGITS
staff
0.22
reporter
0.22
reporters
0.22
Staff
0.20
Agencies
0.19
Il
0.18
Associated
0.18
ron
0.17
staff
0.16
correspondent
0.16
Activations Density 0.039%