Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data

BMC Genomics
Jialei DuanXiuying Kong


Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq). The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments. In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these s...Continue Reading


Apr 5, 2002·Genome Research·W James Kent
May 5, 2004·Genome Research·Ewan BirneyRichard Durbin
Nov 8, 2006·Nucleic Acids Research·Kevin L ChildsAgnes P Chan
Dec 8, 2007·Nucleic Acids Research·Jon DuvickVolker Brendel
May 20, 2009·Bioinformatics·Heng Li, Richard Durbin
Jun 10, 2009·Bioinformatics·Heng Li1000 Genome Project Data Processing Subgroup
Jun 17, 2009·Bioinformatics·Inanc BirolSteven J M Jones
Jan 19, 2010·Bioinformatics·Heng Li, Richard Durbin
Aug 10, 2010·Genome Research·Yann Surget-Groba, Juan I Montoya-Burgos
Sep 30, 2010·BMC Bioinformatics·Murray P CoxPatrick J Biggs
Oct 12, 2010·Nature Methods·Gordon RobertsonInanc Birol
Dec 15, 2010·Bioinformatics·Marten BoetzerWalter Pirovano
May 17, 2011·Nature Biotechnology·Manfred G GrabherrAviv Regev
Dec 1, 2011·Nucleic Acids Research·Marco PuntaRobert D Finn


Feb 4, 2014·BMC Research Notes·Keng-See ChowZainorlina Mohd-Zainuddin
Oct 3, 2013·Bioinformatics·Dror HibshOrit Shefi
Jan 29, 2016·Journal of Invertebrate Pathology·Abul Farah Md HasanuzzamanPaulino Martínez
Jan 15, 2016·BMC Genomics·Martin BensKarol Szafranski
Jul 27, 2014·Genome Biology·Nadia M Davidson, Alicia Oshlack
Mar 1, 2015·Molecular Genetics and Genomics : MGG·Avrajit ChakrabortyNagendra Kumar Singh
Apr 29, 2015·Frontiers in Plant Science·Kiruba S Arun-Chinnappa, David W McCurdy
Apr 17, 2015·BMC Bioinformatics·Stanley Kimbung MbandiAlan Christoffels
Jun 27, 2013·Genome Biology·Ksenia V KrasilevaJorge Dubcovsky
Aug 24, 2016·Frontiers in Plant Science·Feng ChenDangqun Cui
Nov 13, 2018·GigaScience·Miriam Payá-MilansMargaret Staton
Jul 26, 2018·Physiology and Molecular Biology of Plants : an International Journal of Functional Plant Biology·Bhavana TiwariK V Bhat
May 13, 2017·Nature Communications·Chuan XiaXiuying Kong

Related Concepts

Sequence Determinations, RNA
RNA, Plant
Computational Molecular Biology
Expressed Sequence Tags
Gene Expression Profiles
Wheat (Dietary)

Trending Feeds


Coronaviruses encompass a large family of viruses that cause the common cold as well as more serious diseases, such as the ongoing outbreak of coronavirus disease 2019 (COVID-19; formally known as 2019-nCoV). Coronaviruses can spread from animals to humans; symptoms include fever, cough, shortness of breath, and breathing difficulties; in more severe cases, infection can lead to death. This feed covers recent research on COVID-19.

Chronic Fatigue Syndrome

Chronic fatigue syndrome is a disease characterized by unexplained disabling fatigue; the pathology of which is incompletely understood. Discover the latest research on chronic fatigue syndrome here.

Systemic Juvenile Idiopathic Arthritis

Systemic juvenile idiopathic arthritis is a rare rheumatic disease that affects children. Symptoms include joint pain, but also fevers and skin rashes. Here is the latest on this disease.

Chromatin Regulation and Circadian Clocks

The circadian clock plays an important role in regulating transcriptional dynamics through changes in chromatin folding and remodelling. Discover the latest research on Chromatin Regulation and Circadian Clocks here.

Central Pontine Myelinolysis

Central Pontine Myelinolysis is a neurologic disorder caused most frequently by rapid correction of hyponatremia and is characterized by demyelination that affects the central portion of the base of the pons. Here is the latest research on this disease.

Myocardial Stunning

Myocardial stunning is a mechanical dysfunction that persists after reperfusion of previously ischemic tissue in the absence of irreversible damage including myocardial necrosis. Here is the latest research.

Pontocerebellar Hypoplasia

Pontocerebellar hypoplasias are a group of neurodegenerative autosomal recessive disorders with prenatal onset, atrophy or hypoplasia of the cerebellum, hypoplasia of the ventral pons, microcephaly, variable neocortical atrophy and severe mental and motor impairments. Here is the latest research on pontocerebellar hypoplasia.

Cell Atlas Along the Gut-Brain Axis

Profiling cells along the gut-brain axis at the single cell level will provide unique information for each cell type, a three-dimensional map of how cell types work together to form tissues, and insights into how changes in the map underlie health and disease of the GI system and its crosstalk with the brain. Disocver the latest research on single cell analysis of the gut-brain axis here.

Chronic Traumatic Encephalopathy

Chronic Traumatic Encephalopathy (CTE) is a progressive degenerative disease that occurs in individuals that suffer repetitive brain trauma. Discover the latest research on traumatic encephalopathy here.