INDEX
Explanations
references to articles and opinion pieces in media publications
New Auto-Interp
Negative Logits
lector
-0.18
publicly
-0.15
enstein
-0.15
inas
-0.15
azor
-0.14
llum
-0.14
ÙĥÙĪØ±
-0.14
Wikimedia
-0.14
enge
-0.14
bbc
-0.14
POSITIVE LOGITS
article
0.34
coverage
0.29
column
0.29
articles
0.28
headline
0.27
reporting
0.26
article
0.26
columns
0.26
coverage
0.24
editor
0.24
Activations Density 0.233%