The distribution of word matches between Markovian sequences with periodic boundary conditions
Word match counts have traditionally been proposed as an alignment-free measure of similarity for biological sequences. The D2 statistic, which simply counts the number of exact word matches between two sequences, is a useful test bed for developing rigorous mathematical results, which can then be extended to more biologically useful measures. The distributional properties of the D2 statistic under the null hypothesis of identically and independently distributed letters have been studied...[Show more]
|Collections||ANU Research Publications|
|Source:||Journal of Computational Biology 21.1 (2014): 41-63|
|Burden et al The distribution of word matches 2014.pdf||720.17 kB||Adobe PDF|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.