LE QUY DON
Technical University
VietnameseClear Cookie - decide language by browser settings

Pasa: leveraging population pangenome graph to scaffold prokaryote genome assemblies

Do, V.H. and Nguyen, S.H. and Le, D.Q. and Nguyen, T.T. and Nguyen, C.H. and Ho, T.H. and Vo, N.S. and Nguyen, T. and Nguyen, H.A. and Cao, M.D. (2024) Pasa: leveraging population pangenome graph to scaffold prokaryote genome assemblies. Nucleic Acids Research, 52 (3). E15. ISSN 03051048

Full text not available from this repository. (Upload)

Abstract

Whole genome sequencing has increasingly become the essential method for studying the genetic mechanisms of antimicrobial resistance and for surveillance of drug-resistant bacterial pathogens. The majority of bacterial genomes sequenced to date have been sequenced with Illumina sequencing technology, owing to its high-throughput, excellent sequence accuracy, and low cost. However, because of the short-read nature of the technology, these assemblies are fragmented into large numbers of contigs, hindering the obtaining of full information of the genome. We develop Pasa, a graph-based algorithm that utilizes the pangenome graph and the assembly graph information to improve scaffolding quality. By leveraging the population information of the bacteria species, Pasa is able to utilize the linkage information of the gene families of the species to resolve the contig graph of the assembly. We show that our method outperforms the current state of the arts in terms of accuracy, and at the same time, is computationally efficient to be applied to a large number of existing draft assemblies. © 2024 Oxford University Press. All rights reserved.

Item Type: Article
Divisions: Faculties > Faculty of Information Technology
Identification Number: 10.1093/nar/gkad1170
Uncontrolled Keywords: scaffold protein; contig; nucleic acid, algorithm; Article; bacterial genome; data clustering; Escherichia coli; gene cluster; gene dosage; gene sequence; high throughput sequencing; illumina sequencing; Klebsiella pneumoniae; microbial genome; multidrug resistance; nonhuman; pangenome; Streptococcus pneumoniae; whole genome sequencing; antibiotic resistance; article; controlled study; multigene family
URI: http://eprints.lqdtu.edu.vn/id/eprint/11143

Actions (login required)

View Item
View Item