000 01901nam a2200229Ia 4500
003 MX-MdCICY
005 20250625153919.0
040 _cCICY
090 _aB-13265
245 1 0 _aGenovo: De Novo Assembly for Metagenomes
490 0 _vJournal of Computational Biology, 18(3), p.429-443, 2011
520 3 _aNext-generation sequencing technologies produce a large number of noisy reads from the DNA in a sample. Metagenomics and population sequencing aim to recover the genomic sequences of the species in the sample, which could be of high diversity. Methods geared towards single sequence reconstruction are not sensitive enough when applied in this setting. We introduce a generative probabilistic model of read generation from environmental samples and present Genovo, a novel de novo sequence assembler that discovers likely sequence reconstructions under the model. A nonparametric prior accounts for the unknown number of genomes in the sample. Inference is performed by applying a series of hillclimbing steps iteratively until convergence. We compare the performance of Genovo to three other short read assembly programs in a series of synthetic experiments and across nine metagenomic datasets created using the 454 platform, the largest of which has 311k reads. Genovo's reconstructions cover more bases and recover more genes than the other methods, even for low-abundance sequences, and yield a higher assembly score. Supplementary Material is available at www.liebertoinline.com/cmb.
650 1 4 _aALGORITHMS
650 1 4 _aCANCER GENOMICS
650 1 4 _aSEQUENCES
700 1 2 _aLaserson, J.
700 1 2 _aJojic, V.
700 1 2 _aKoller, D.
856 4 0 _uhttps://drive.google.com/file/d/1R3Bc_HMkTHJNhLTxJCY2wySvtbuDH-eF/view?usp=drivesdk
_zPara ver el documento ingresa a Google con tu cuenta: @cicy.edu.mx
942 _2Loc
_cREF1
008 250602s9999 xx |||||s2 |||| ||und|d
999 _c47466
_d47466