Kmer2SNP: Reference-free SNP calling from raw reads based on matching

dc.contributor.authorLi, Yanbo
dc.contributor.authorPatel, Hardip
dc.contributor.authorLin, Yu
dc.coverage.spatialSeoul, South Korea, Virtually
dc.date.accessioned2024-01-24T00:30:31Z
dc.date.createdDecember 16-19, 2020
dc.date.issued2021
dc.date.updated2022-10-02T07:17:44Z
dc.description.abstractSNP calling is a fundamental problem of genetic analysis and has many applications, such as gene-disease diagnosis, drug design, and ancestry inference. Prior approaches either require high-quality reference genome, or suffer from low recall/precision or high runtime. We develop a reference-free algorithm Kmer2SNP to call SNP directly from raw reads, an approach that models SNP calling into a maximum weight matching problem. We benchmark Kmer2SNP against reference-free methods including hybrid (assembly-based) and assembly-free methods on both simulated and real datasets. Experimental results show that Kmer2SNP achieves better SNP calling quality while being an order of magnitude faster than the state-of-the-art methods. Kmer2SNP shows the potential of calling SNPs only using k-mers from raw reads without assembly. The source code is freely available at https://github.com/yanboANU/Kmer2SNP.en_AU
dc.format.mimetypeapplication/pdfen_AU
dc.identifier.isbn978-1-7281-6215-7en_AU
dc.identifier.urihttp://hdl.handle.net/1885/311808
dc.language.isoen_AUen_AU
dc.publisherIEEEen_AU
dc.relation.ispartofseries2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)en_AU
dc.rights© 2020 IEEEen_AU
dc.subjectSNP callingen_AU
dc.subjectReference-freeen_AU
dc.subjectK-mer analysisen_AU
dc.subjectMaximum-weight matchingen_AU
dc.titleKmer2SNP: Reference-free SNP calling from raw reads based on matchingen_AU
dc.typeConference paperen_AU
local.bibliographicCitation.lastpage212en_AU
local.bibliographicCitation.startpage208en_AU
local.contributor.affiliationLi, Yanbo, College of Engineering and Computer Science, ANUen_AU
local.contributor.affiliationPatel, Hardip, College of Health and Medicine, ANUen_AU
local.contributor.affiliationLin, Yu, College of Engineering and Computer Science, ANUen_AU
local.contributor.authoremailu4269546@anu.edu.auen_AU
local.contributor.authoruidLi, Yanbo, u6260133en_AU
local.contributor.authoruidPatel, Hardip, u4269546en_AU
local.contributor.authoruidLin, Yu, u1024708en_AU
local.description.embargo2099-12-31
local.description.notesImported from ARIESen_AU
local.description.refereedYes
local.identifier.absfor310201 - Bioinformatic methods developmenten_AU
local.identifier.ariespublicationa383154xPUB18744en_AU
local.identifier.doi10.1109/BIBM49941.2020.9313433en_AU
local.identifier.scopusID2-s2.0-85100357136
local.identifier.thomsonIDWOS:000659487100037
local.identifier.uidSubmittedBya383154en_AU
local.publisher.urlhttps://www.ieee.org/en_AU
local.type.statusPublished Versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Kmer2SNP_reference-free_SNP_calling_from_raw_reads_based_on_matching.pdf
Size:
202.51 KB
Format:
Adobe Portable Document Format
Description: