Ranjard, LouisWong, ThomasRodrigo, Allen2024-02-222024-02-221471-2105http://hdl.handle.net/1885/313823Background In short-read DNA sequencing experiments, the read coverage is a key parameter to successfully assemble the reads and reconstruct the sequence of the input DNA. When coverage is very low, the original sequence reconstruction from the reads can be difficult because of the occurrence of uncovered gaps. Reference guided assembly can then improve these assemblies. However, when the available reference is phylogenetically distant from the sequencing reads, the mapping rate of the reads can be extremely low. Some recent improvements in read mapping approaches aim at modifying the reference according to the reads dynamically. Such approaches can significantly improve the alignment rate of the reads onto distant references but the processing of insertions and deletions remains challenging.This research was funded by an Australian Research Council Discovery Project Grant #DP160103474application/pdfen-AU© The Author(s). 2019, corrected publication 2019 Open Accesshttps://creativecommons.org/licenses/by/4.0/AssemblyAmpliconMachine learningWestern-grey kangarooMitochondrionEffective machine-learning assembly for next-generation amplicon sequencing with very low overage201910.1186/s12859-019-3287-22022-10-09Creative Commons Attribution License