Skip navigation
Skip navigation

Word match counts between markovian biological sequences

Burden, Conrad; Leopardi, Paul; Foret, Sylvain


The D2 statistic, which counts the number of word matches between two given sequences, has long been proposed as a measure of similarity for biological sequences. Much of the mathematically rigorous work carried out to date on the properties of the D2 statistic has been restricted to the case of ‘Bernoulli’ sequences composed of identically and independently distributed letters. Here the properties of the distribution of this statistic for the biologically more realistic case of Markovian...[Show more]

CollectionsANU Research Publications
Date published: 2014
Type: Journal article
Source: Communications in Computer and Information Science
DOI: 10.1007/978-3-662-44485-6_11


File Description SizeFormat Image
01_Burden_Word_match_counts_between_2014.pdf643.13 kBAdobe PDF    Request a copy

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator