Mugen-UMAP: UMAP visualization and clustering of mutated genes in single-cell DNA sequencing data
| dc.contributor.author | Li, Teng | en |
| dc.contributor.author | Zou, Yiran | en |
| dc.contributor.author | Li, Xianghan | en |
| dc.contributor.author | Wong, Thomas K.F. | en |
| dc.contributor.author | Rodrigo, Allen G. | en |
| dc.date.accessioned | 2025-05-23T10:21:01Z | |
| dc.date.available | 2025-05-23T10:21:01Z | |
| dc.date.issued | 2024 | en |
| dc.description.abstract | Background: The application of Uniform Manifold Approximation and Projection (UMAP) for dimensionality reduction and visualization has revolutionized the analysis of single-cell RNA expression and population genetics. However, its potential in single-cell DNA sequencing data analysis, particularly for visualizing gene mutation information, has not been fully explored. Results: We introduce Mugen-UMAP, a novel Python-based program that extends UMAP’s utility to single-cell DNA sequencing data. This innovative tool provides a comprehensive pipeline for processing gene annotation files of single-cell somatic single-nucleotide variants and metadata to the visualization of UMAP projections for identifying clusters, along with various statistical analyses. Employing Mugen-UMAP, we analyzed whole-exome sequencing data from 365 single-cell samples across 12 non-small cell lung cancer (NSCLC) patients, revealing distinct clusters associated with histological subtypes of NSCLC. Moreover, to demonstrate the general utility of Mugen-UMAP, we applied the program to 9 additional single-cell WES datasets from various cancer types, uncovering interesting patterns of cell clusters that warrant further investigation. In summary, Mugen-UMAP provides a quick and effective visualization method to uncover cell cluster patterns based on the gene mutation information from single-cell DNA sequencing data. Conclusions: The application of Mugen-UMAP demonstrates its capacity to provide valuable insights into the visualization and interpretation of single-cell DNA sequencing data. Mugen-UMAP can be found at https://github.com/tengchn/Mugen-UMAP | en |
| dc.description.sponsorship | We thank Yuantong Ding, Xia Hua, Bui Quang Minh, Imelda Forteza, and Tianshu Yang for participating in our group meetings where these results were discussed. We also thank Michael J. Campa, Elizabeth B. Gottlin, and Edward F. Patz Jr for consultation on various clinical aspects of NSCLC and helpful discussions. We acknowledge the use of New Zealand eScience Infrastructure (NeSI) high performance computing facilities. This work was supported by the start-up funds from the University of Auckland, New Zealand to AR (4020\u201312090). | en |
| dc.description.status | Peer-reviewed | en |
| dc.identifier.other | PubMed:39333868 | en |
| dc.identifier.other | ORCID:/0000-0002-0580-6324/work/184099386 | en |
| dc.identifier.scopus | 85205336332 | en |
| dc.identifier.uri | http://www.scopus.com/inward/record.url?scp=85205336332&partnerID=8YFLogxK | en |
| dc.identifier.uri | https://hdl.handle.net/1885/733752003 | |
| dc.language.iso | en | en |
| dc.rights | Publisher Copyright: © The Author(s) 2024. | en |
| dc.source | BMC Bioinformatics | en |
| dc.subject | Clustering | en |
| dc.subject | Gene mutation | en |
| dc.subject | Single-cell DNA sequencing | en |
| dc.subject | UMAP | en |
| dc.subject | Visualization | en |
| dc.title | Mugen-UMAP: UMAP visualization and clustering of mutated genes in single-cell DNA sequencing data | en |
| dc.type | Journal article | en |
| dspace.entity.type | Publication | en |
| local.contributor.affiliation | Li, Teng; Division of Ecology and Evolution, Research School of Biology, ANU College of Science and Medicine, The Australian National University | en |
| local.contributor.affiliation | Zou, Yiran; Division of Ecology and Evolution, Research School of Biology, ANU College of Science and Medicine, The Australian National University | en |
| local.contributor.affiliation | Li, Xianghan; Genome Sciences and Cancer Division, John Curtin School of Medical Research, ANU College of Science and Medicine, The Australian National University | en |
| local.contributor.affiliation | Wong, Thomas K.F.; School of Computing, ANU College of Systems and Society, The Australian National University | en |
| local.contributor.affiliation | Rodrigo, Allen G.; Administration, Research School of Biology, ANU College of Science and Medicine, The Australian National University | en |
| local.identifier.citationvolume | 25 | en |
| local.identifier.doi | 10.1186/s12859-024-05928-x | en |
| local.identifier.pure | d5dab0e4-4c88-4858-8acb-c168cb0705d6 | en |
| local.identifier.url | https://www.scopus.com/pages/publications/85205336332 | en |
| local.type.status | Published | en |