INDEX
Explanations
references to newspapers
mentions of newspapers
New Auto-Interp
Negative Logits
ayne
-0.87
imilar
-0.77
cius
-0.72
adders
-0.71
fee
-0.70
laus
-0.70
sounding
-0.69
paren
-0.68
egal
-0.68
alos
-0.68
POSITIVE LOGITS
columnist
0.95
publisher
0.90
lisher
0.89
archives
0.88
editor
0.86
newspapers
0.85
newspaper
0.83
Newsp
0.81
publishers
0.80
reporter
0.78
Activations Density 0.018%