Kmer2SNP: Reference-free SNP calling from raw reads based on matching
dc.contributor.author | Li, Yanbo | |
dc.contributor.author | Patel, Hardip | |
dc.contributor.author | Lin, Yu | |
dc.coverage.spatial | Seoul, South Korea, Virtually | |
dc.date.accessioned | 2024-01-24T00:30:31Z | |
dc.date.created | December 16-19, 2020 | |
dc.date.issued | 2021 | |
dc.date.updated | 2022-10-02T07:17:44Z | |
dc.description.abstract | SNP calling is a fundamental problem of genetic analysis and has many applications, such as gene-disease diagnosis, drug design, and ancestry inference. Prior approaches either require high-quality reference genome, or suffer from low recall/precision or high runtime. We develop a reference-free algorithm Kmer2SNP to call SNP directly from raw reads, an approach that models SNP calling into a maximum weight matching problem. We benchmark Kmer2SNP against reference-free methods including hybrid (assembly-based) and assembly-free methods on both simulated and real datasets. Experimental results show that Kmer2SNP achieves better SNP calling quality while being an order of magnitude faster than the state-of-the-art methods. Kmer2SNP shows the potential of calling SNPs only using k-mers from raw reads without assembly. The source code is freely available at https://github.com/yanboANU/Kmer2SNP. | en_AU |
dc.format.mimetype | application/pdf | en_AU |
dc.identifier.isbn | 978-1-7281-6215-7 | en_AU |
dc.identifier.uri | http://hdl.handle.net/1885/311808 | |
dc.language.iso | en_AU | en_AU |
dc.publisher | IEEE | en_AU |
dc.relation.ispartofseries | 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) | en_AU |
dc.rights | © 2020 IEEE | en_AU |
dc.subject | SNP calling | en_AU |
dc.subject | Reference-free | en_AU |
dc.subject | K-mer analysis | en_AU |
dc.subject | Maximum-weight matching | en_AU |
dc.title | Kmer2SNP: Reference-free SNP calling from raw reads based on matching | en_AU |
dc.type | Conference paper | en_AU |
local.bibliographicCitation.lastpage | 212 | en_AU |
local.bibliographicCitation.startpage | 208 | en_AU |
local.contributor.affiliation | Li, Yanbo, College of Engineering and Computer Science, ANU | en_AU |
local.contributor.affiliation | Patel, Hardip, College of Health and Medicine, ANU | en_AU |
local.contributor.affiliation | Lin, Yu, College of Engineering and Computer Science, ANU | en_AU |
local.contributor.authoremail | u4269546@anu.edu.au | en_AU |
local.contributor.authoruid | Li, Yanbo, u6260133 | en_AU |
local.contributor.authoruid | Patel, Hardip, u4269546 | en_AU |
local.contributor.authoruid | Lin, Yu, u1024708 | en_AU |
local.description.embargo | 2099-12-31 | |
local.description.notes | Imported from ARIES | en_AU |
local.description.refereed | Yes | |
local.identifier.absfor | 310201 - Bioinformatic methods development | en_AU |
local.identifier.ariespublication | a383154xPUB18744 | en_AU |
local.identifier.doi | 10.1109/BIBM49941.2020.9313433 | en_AU |
local.identifier.scopusID | 2-s2.0-85100357136 | |
local.identifier.thomsonID | WOS:000659487100037 | |
local.identifier.uidSubmittedBy | a383154 | en_AU |
local.publisher.url | https://www.ieee.org/ | en_AU |
local.type.status | Published Version | en_AU |
Downloads
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Kmer2SNP_reference-free_SNP_calling_from_raw_reads_based_on_matching.pdf
- Size:
- 202.51 KB
- Format:
- Adobe Portable Document Format
- Description: