INDEX
Explanations
phrases related to news publications
the repeated mention of a specific publication or journal
New Auto-Interp
Negative Logits
creen
-0.65
holders
-0.64
theirs
-0.62
regress
-0.62
ours
-0.60
minus
-0.60
brightly
-0.59
Pric
-0.59
dh
-0.58
cream
-0.58
POSITIVE LOGITS
osphere
0.95
istically
0.93
istic
0.91
Sentinel
0.90
ist
0.90
Editors
0.88
£ı
0.88
ism
0.86
ournals
0.86
ic
0.85
Activations Density 0.027%