The marked tokens appear to be fragments of words that have been split across delimiters, often appearing within proper nouns, technical terms, or compound words in diverse academic and technical texts. The patterns suggest these are either OCR/text encoding artifacts, reference citations within brackets, or deliberate word segmentation where parts of a single word are delimited separately from their surrounding context.