Skip navigation
Skip navigation

Bayesian analysis of congruence of core genes in Prochlorococcus and Synechococcus and implications on horizontal gene transfer

Matzke, Nicholas; Shih, Patrick M.; Kerfeld, Cheryl

Description

It is often suggested that horizontal gene transfer is so ubiquitous in microbes that the concept of a phylogenetic tree representing the pattern of vertical inheritance is oversimplified or even positively misleading. “Universal proteins” have been used to infer the organismal phylogeny, but have been criticized as being only the “tree of one percent.” Currently, few options exist for those wishing to rigorously assess how well a universal protein phylogeny, based on a relative handful of...[Show more]

dc.contributor.authorMatzke, Nicholas
dc.contributor.authorShih, Patrick M.
dc.contributor.authorKerfeld, Cheryl
dc.date.accessioned2018-11-29T22:56:12Z
dc.date.available2018-11-29T22:56:12Z
dc.identifier.issn1932-6203
dc.identifier.urihttp://hdl.handle.net/1885/153436
dc.description.abstractIt is often suggested that horizontal gene transfer is so ubiquitous in microbes that the concept of a phylogenetic tree representing the pattern of vertical inheritance is oversimplified or even positively misleading. “Universal proteins” have been used to infer the organismal phylogeny, but have been criticized as being only the “tree of one percent.” Currently, few options exist for those wishing to rigorously assess how well a universal protein phylogeny, based on a relative handful of well-conserved genes, represents the phylogenetic histories of hundreds of genes. Here, we address this problem by proposing a visualization method and a statistical test within a Bayesian framework. We use the genomes of marine cyanobacteria, a group thought to exhibit substantial amounts of HGT, as a test case. We take 379 orthologous gene families from 28 cyanobacteria genomes and estimate the Bayesian posterior distributions of trees – a “treecloud” – for each, as well as for a concatenated dataset based on putative “universal proteins.” We then calculate the average distance between trees within and between all treeclouds on various metrics and visualize this high-dimensional space with non-metric multidimensional scaling (NMMDS). We show that the tree space is strongly clustered and that the universal protein treecloud is statistically significantly closer to the center of this tree space than any individual gene treecloud. We apply several commonly-used tests for incongruence/HGT and show that they agree HGT is rare in this dataset, but make different choices about which genes were subject to HGT. Our results show that the question of the representativeness of the “tree of one percent” is a quantitative empirical question, and that the phylogenetic central tendency is a meaningful observation even if many individual genes disagree due to the various sources of incongruence.
dc.format.mimetypeapplication/pdf
dc.publisherPublic Library of Science
dc.sourcePLOS ONE (Public Library of Science)
dc.subjectKeywords: article; bacterial gene; Bayes theorem; genetic variability; horizontal gene transfer; multidimensional scaling; nonhuman; phylogenetic tree; phylogeny; Prochlorococcus; protein database; sequence alignment; Synechococcus; Aquatic Organisms; Bayes Theorem
dc.titleBayesian analysis of congruence of core genes in Prochlorococcus and Synechococcus and implications on horizontal gene transfer
dc.typeJournal article
local.description.notesImported from ARIES
local.identifier.citationvolume9
dc.date.issued2014
local.identifier.absfor060309 - Phylogeny and Comparative Analysis
local.identifier.absfor060409 - Molecular Evolution
local.identifier.absfor060408 - Genomics
local.identifier.ariespublicationU3488905xPUB16725
local.type.statusPublished Version
local.contributor.affiliationMatzke, Nicholas, College of Science, ANU
local.contributor.affiliationShih, Patrick M., University of California
local.contributor.affiliationKerfeld, Cheryl, DOE Joint Genome Institute
local.bibliographicCitation.issue1
local.bibliographicCitation.startpagee85103
local.bibliographicCitation.lastpagee85103
local.identifier.doi10.1371/journal.pone.0085103
local.identifier.absseo970106 - Expanding Knowledge in the Biological Sciences
dc.date.updated2018-11-29T08:10:41Z
local.identifier.scopusID2-s2.0-84908084037
local.identifier.thomsonID000330244500036
dcterms.accessRightsOpen Access
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Matzke_Bayesian_analysis_of_2014.pdf1.14 MBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator