Something borrowed: sequence alignment and the identification of similar passages in large text collections
The following article describes a simple technique to identify lexically-similar passages in large collections of text using sequence alignment algorithms. Primarily used in the field of bioinformatics to identify similar segments of DNA in genome research, sequence alignment has also been employed in many other domains, from plagiarism detection to image processing. While we have applied this approach to a wide variety of diverse text collections, we will focus our discussion here on the...[Show more]
|Collections||ANU Research Publications|
|Source:||Digital Studies / Le champ numérique 2.1 (2010)|
|Horton et. al. Something borrowed sequence alignment 2010.pdf||199.05 kB||Adobe PDF|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.