Bayesian DNA copy number analysis
Loading...
Date
Authors
Rancoita, P M V
Hutter, Marcus
Bertoni, Francesco
Kwee, Ivo
Journal Title
Journal ISSN
Volume Title
Publisher
BioMed Central Ltd
Abstract
BACKGROUND: Some diseases, like tumors, can be related to chromosomal aberrations, leading to
changes of DNA copy number. The copy number of an aberrant genome can be represented as a
piecewise constant function, since it can exhibit regions of deletions or gains. Instead, in a healthy
cell the copy number is two because we inherit one copy of each chromosome from each our
parents.
Bayesian Piecewise Constant Regression (BPCR) is a Bayesian regression method for data that are
noisy observations of a piecewise constant function. The method estimates the unknown segment
number, the endpoints of the segments and the value of the segment levels of the underlying
piecewise constant function. The Bayesian Regression Curve (BRC) estimates the same data with
a smoothing curve. However, in the original formulation, some estimators failed to properly
determine the corresponding parameters. For example, the boundary estimator did not take into
account the dependency among the boundaries and succeeded in estimating more than one
breakpoint at the same position, losing segments.
RESULTS: We derived an improved version of the BPCR (called mBPCR) and BRC, changing the
segment number estimator and the boundary estimator to enhance the fitting procedure. We also
proposed an alternative estimator of the variance of the segment levels, which is useful in case of
data with high noise. Using artificial data, we compared the original and the modified version of
BPCR and BRC with other regression methods, showing that our improved version of BPCR
generally outperformed all the others. Similar results were also observed on real data.
CONCLUSION: We propose an improved method for DNA copy number estimation, mBPCR, which
performed very well compared to previously published algorithms. In particular, mBPCR was more
powerful in the detection of the true position of the breakpoints and of small aberrations in very
noisy data. Hence, from a biological point of view, our method can be very useful, for example, to
find targets of genomic aberrations in clinical cancer samples.
Description
Citation
BMC Bioinformatics 10.10 (2009)
Collections
Source
BMC Bioinformatics
Type
Book Title
Entity type
Access Statement
License Rights
Restricted until
Downloads
File
Description