saSNP Approach for Scalable SNP Analyses of Multiple Bacterial or Viral Genomes [electronic resource]

 0 Người đánh giá. Xếp hạng trung bình 0

Tác giả:

Ngôn ngữ: eng

Ký hiệu phân loại: 570.15 Scientific principles

Thông tin xuất bản: Livermore, Calif : Oak Ridge, Tenn. : Lawrence Livermore National Laboratory ; Distributed by the Office of Scientific and Technical Information, U.S. Dept. of Energy, 2010

Mô tả vật lý: Size: 20 p. : , digital, PDF file.

Bộ sưu tập: Metadata

ID: 262414

With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs. The method is fast to compute, finding SNPs and building a SNP phylogeny in seconds to hours. We use it to identify thousands of putative SNPs from all publicly available Filoviridae, Poxviridae, foot-and-mouth disease virus, Bacillus, and Escherichia coli genomes and plasmids. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle as input hundreds of gigabases of sequence in a single run. The algorithm is based on k-mer analysis using a suffix array, so we call it saSNP.
Tạo bộ sưu tập với mã QR

THƯ VIỆN - TRƯỜNG ĐẠI HỌC CÔNG NGHỆ TP.HCM

ĐT: (028) 71010608 | Email: tt.thuvien@hutech.edu.vn

Copyright @2020 THƯ VIỆN HUTECH