Skip navigation
Skip navigation

Statistical considerations underpinning an alignment-free sequence comparison method

Jing, Junmei; Burden, Conrad; Foret, Sylvain; Wilson, Susan


The D2 statistic is defined as the number of word matches of prespecified length k, with up to t mismatches, shared between two given sequences. This statistic finds its application in alignment-free comparisons of biological sequences. It has two main advantages over alignment-based methods for nucleotide and amino-acid sequence comparisons, such as BLAST (basic local alignment search tool). These are (i) D2 does not assume that homologous segments are contiguous, and (ii) the algorithm is...[Show more]

CollectionsANU Research Publications
Date published: 2010
Type: Journal article
Source: Journal of the Korean statistical society
DOI: 10.1016/j.jkss.2010.02.009


File Description SizeFormat Image
01_Jing_Statistical_considerations_2010.pdf827.58 kBAdobe PDF    Request a copy

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  19 May 2020/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator