check
De novo assembly of a new Solanum pennellii accession using nanopore sequencing | Plant Sciences and Genetics in Agriculture

Publications By Year

Publications by Authors

Recent Publications

Contact Us

 

Mailing Address:
The Robert H. Smith Institute of
Plant Sciences and Genetics
in Agriculture
Herzl 229, Rehovot 7610001, Israel

Administrator: 
Neomi Maimon 
Tel: 972-8-948-9251,
Fax: 972-8-948-9899,
E-mail: neomim@savion.huji.ac.il

Secretary of teaching program:
Ms. Iris Izenshtadt
Tel: 972-8-9489333
E-mail: Iris.Izenshtadt@mail.huji.ac.il

Director: 
Prof. Naomi Ori
Tel: 972-8-948-9605
E-mail: naomi.ori@mail.huji.ac.il

 

De novo assembly of a new Solanum pennellii accession using nanopore sequencing

Citation:

Schmidt, M. H. - W. ; Vogel, A. ; Denton, A. K. ; Istace, B. ; Wormit, A. ; van de Geest, H. ; Bolger, M. E. ; Alseekh, S. ; Maß, J. ; Pfaff, C. ; et al. De Novo Assembly Of A New Solanum Pennellii Accession Using Nanopore Sequencing. Plant Cell 2017, 29, 2336-2348.

Abstract:

Updates in nanopore technology have made it possible to obtain gigabases of sequence data. Prior to this, nanopore sequencing technology was mainly used to analyze microbial samples. Here, we describe the generation of a comprehensive nanopore sequencing data set with a median read length of 11,979 bp for a self-compatible accession of the wild tomato species Solanum pennellii. We describe the assembly of its genome to a contig N50 of 2.5 MB. The assembly pipeline comprised initial read correction with Canu and assembly with SMARTdenovo. The resulting raw nanopore-based de novo genome is structurally highly similar to that of the reference S. pennellii LA716 accession but has a high error rate and was rich in homopolymer deletions. After polishing the assembly with Illumina reads, we obtained an error rate of <0.02% when assessed versus the same Illumina data. We obtained a gene completeness of 96.53%, slightly surpassing that of the reference S. pennellii. Taken together, our data indicate that such long read sequencing data can be used to affordably sequence and assemble gigabase-sized plant genomes. © 2017 The author(s).

Website