Skip navigation
Skip navigation

The distribution of word matches between Markovian sequences with periodic boundary conditions

Burden, Conrad J; Leopardi, Paul; Foret, Sylvain


Word match counts have traditionally been proposed as an alignment-free measure of similarity for biological sequences. The D2 statistic, which simply counts the number of exact word matches between two sequences, is a useful test bed for developing rigorous mathematical results, which can then be extended to more biologically useful measures. The distributional properties of the D2 statistic under the null hypothesis of identically and independently distributed letters have been studied...[Show more]

CollectionsANU Research Publications
Date published: 2014
Type: Journal article
Source: Journal of Computational Biology 21.1 (2014): 41-63
DOI: 10.1089/cmb.2012.0277


File Description SizeFormat Image
Burden et al The distribution of word matches 2014.pdf720.17 kBAdobe PDFThumbnail

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator