INDEX
Explanations
phrases indicating continuation or elaboration on a topic
repeated mentions of "Article Continued Below" phrases
New Auto-Interp
Negative Logits
reb
-0.72
RAW
-0.68
Reborn
-0.64
romy
-0.63
rament
-0.62
reneg
-0.59
formation
-0.59
Noir
-0.58
icion
-0.56
undermin
-0.56
POSITIVE LOGITS
Copy
0.67
↵
0.67
Crossref
0.66
idth
0.63
Continue
0.63
ached
0.62
Close
0.61
<|endoftext|>
0.61
Below
0.61
etitive
0.61
Activations Density 0.013%