INDEX
Explanations
elements of press releases
New Auto-Interp
Negative Logits
ije
-0.15
ance
-0.15
ivor
-0.14
oon
-0.14
itage
-0.14
oyal
-0.14
entin
-0.14
wer
-0.14
ism
-0.14
let
-0.13
POSITIVE LOGITS
PRESS
0.22
PRESS
0.17
today
0.17
press
0.17
ilies
0.16
Press
0.15
Press
0.15
Posted
0.15
(PR
0.15
NEWS
0.15
Activations Density 0.044%