We generated a draft assembly of the quinoa genome using short reads of Illumina HiSeq2500 and long reads of PacBio RSII. We obtained short reads of 290.8 Gb and long reads of 45.8 Gb.
After assembling the short reads, further scaffolding and gap-closing were performed using the long reads. Totally, we determined 24,847 scaffolds as the draft genome sequence (Cqu_r1.0). The total length of Cqu_r1.0 was 1,087,413,657 bp and the N50 of the scaffolds was 86,941 bp. Gene prediction analysis revealed 226,647 coding sequences (CDSs; Cqu_r1.0_cds). The total length of Cqu_r1.0_cds was 190,451,495 bp.
Of these, the functions of 62,512 CDSs were annotated by BLAST analysis of the NCBI NR databases. Cqu_r1.0, Cqu_r1.0_cds and their deduced amino acid sequences (Cqu_r1.0_pep), BLAST annotations, and the results of a domain search against InterPro are opened on this website, the Quinoa Genome DataBase (QGDB).
We really hope the QGDB will be utilized as a valuable resource that can be used in efforts to reveal the mechanisms underlying useful traits of quinoa.