INDEX
Explanations
phrases related to news reports
sentence endings and punctuation in the text
New Auto-Interp
Negative Logits
intending
-0.78
mechanically
-0.74
sequently
-0.72
immersion
-0.69
phenomen
-0.67
exha
-0.67
mosqu
-0.66
myster
-0.65
thous
-0.64
dimensional
-0.64
POSITIVE LOGITS
<|endoftext|>
1.24
Subscribe
1.14
POLITICO
1.10
PHOTOS
0.99
READ
0.98
WATCH
0.98
SEE
0.97
Scroll
0.94
CLICK
0.93
Photo
0.92
Activations Density 0.299%