Skip navigation
Skip navigation

Something borrowed: sequence alignment and the identification of similar passages in large text collections

Horton, Russell; Olsen, Mark; Roe, Glenn


The following article describes a simple technique to identify lexically-similar passages in large collections of text using sequence alignment algorithms. Primarily used in the field of bioinformatics to identify similar segments of DNA in genome research, sequence alignment has also been employed in many other domains, from plagiarism detection to image processing. While we have applied this approach to a wide variety of diverse text collections, we will focus our discussion here on the...[Show more]

CollectionsANU Research Publications
Date published: 2010
Type: Journal article
Source: Digital Studies / Le champ numérique 2.1 (2010)


File Description SizeFormat Image
Horton et. al. Something borrowed sequence alignment 2010.pdf199.05 kBAdobe PDFThumbnail

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator