Something borrowed: sequence alignment and the identification of similar passages in large text collections
Download (199.05 kB)
Horton, Russell; Olsen, Mark; Roe, Glenn
Description
The following article describes a simple technique to identify lexically-similar passages in large collections of text using sequence alignment algorithms. Primarily used in the field of bioinformatics to identify similar segments of DNA in genome research, sequence alignment has also been employed in many other domains, from plagiarism detection to image processing. While we have applied this approach to a wide variety of diverse text collections, we will focus our discussion here on the...[Show more]
Collections | ANU Research Publications |
---|---|
Date published: | 2010 |
Type: | Journal article |
URI: | http://hdl.handle.net/1885/12104 |
Source: | Digital Studies / Le champ numérique 2.1 (2010) |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
Horton et. al. Something borrowed sequence alignment 2010.pdf | 199.05 kB | Adobe PDF | ![]() |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator