INDEX
Explanations
phrases related to editorial content or opinions
terms related to editorial content in publications
New Auto-Interp
Negative Logits
llan
-0.74
aday
-0.72
pots
-0.71
omething
-0.71
agne
-0.70
cles
-0.69
rises
-0.68
ptives
-0.67
ulia
-0.67
imilar
-0.66
POSITIVE LOGITS
editor
1.00
ized
1.00
izing
0.99
editorial
0.99
Editorial
0.94
board
0.93
IZE
0.93
cartoons
0.91
izes
0.88
zeb
0.86
Activations Density 0.030%