INDEX
Explanations
newspaper headlines with specific information
end punctuation marks, specifically parentheses and their usage in citations
New Auto-Interp
Negative Logits
ĪĴ
-0.73
fing
-0.71
psychiat
-0.69
wana
-0.66
correl
-0.65
redd
-0.64
fronts
-0.61
phies
-0.61
axe
-0.60
finishing
-0.60
POSITIVE LOGITS
Story
0.98
↵
0.91
More
0.89
Protesters
0.87
Hundreds
0.84
ARTICLE
0.83
<|endoftext|>
0.83
Thousands
0.83
Buy
0.82
Former
0.80
Activations Density 0.057%