Skip navigation
Skip navigation

The distribution of word matches between Markovian sequences with periodic boundary conditions

Burden, Conrad J; Leopardi, Paul; Foret, Sylvain

Description

Word match counts have traditionally been proposed as an alignment-free measure of similarity for biological sequences. The D2 statistic, which simply counts the number of exact word matches between two sequences, is a useful test bed for developing rigorous mathematical results, which can then be extended to more biologically useful measures. The distributional properties of the D2 statistic under the null hypothesis of identically and independently distributed letters have been studied...[Show more]

CollectionsANU Research Publications
Date published: 2014
Type: Journal article
URI: http://hdl.handle.net/1885/11552
Source: Journal of Computational Biology 21.1 (2014): 41-63
DOI: 10.1089/cmb.2012.0277

Download

File Description SizeFormat Image
Burden et al The distribution of word matches 2014.pdf720.17 kBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  23 August 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator