INDEX
Explanations
references to well-known newspapers and publications
New Auto-Interp
Negative Logits
sed
-0.87
cles
-0.76
gone
-0.75
wana
-0.74
uga
-0.72
-0.69
perm
-0.69
isable
-0.68
together
-0.66
apses
-0.64
POSITIVE LOGITS
Editorial
1.25
Newsp
1.19
Magazine
1.14
editorial
1.12
Newspaper
1.10
columnist
1.07
Editors
1.02
Dispatch
1.02
Herald
1.00
article
0.98
Activations Density 0.535%